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TITLE OF THE INVENTION 

MELANIN CONCENTRATING HORMONE RECEPTOR CHIMERIC AND 
FUSION PROTEINS 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims priority to provisional application U.S. 
Serial No. 60/189,698, filed March 15, 2000, hereby incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

10 The references cited herein are not admitted to be prior art to the 

claimed invention. 

Neuropeptides present in the hypothalamus play a major role in 
mediating the control of body weight. (Flier et al, 1998. Cell, 92, 437-440.) Melanin- 
concentrating hormone (MCH) is a cyclic 19-amino acid neuropeptide synthesized as 

15 part of a larger pre-prohormone precursor in the hypothalamus which also encodes 
neuropeptides NEI and NGE. (Nahon et al, 1990. Mol Endocrinol 4, 632-637.) . 
MCH was first identified in salmon pituitary, and in fish MCH affects melanin 
aggregation thus affecting skin pigmentation. In trout and in eels MCH has also been 
shown to be involved in stress induced or CRF-stimulated ACTH release. (Kawauchi 

20 et al, 19S3. Nature 305, 321-323.) 

In humans two genes encoding MCH have been identified that are 
expressed in the brain. (Breton et al, 1993. Mol Brain Res. 18, 297-310.) In 
mammals MCH has been localized primarily to neuronal cell bodies of the 
hypothalamus which are implicated in the control of food intake, including perikarya 

25 of the lateral hypothalamus and zona inertia. (Knigge et al, 1996. Peptides 17, 1063- 
1073.) 

Pharmacological and genetic evidence suggest that the primary mode 
of MCH action is to promote feeding (orexigenic). MCH mRNA is up regulated in 
fasted mice and rats and in the ob/ob mouse. (Qu et al, 1996. Nature 380, 243-247.) 
30 Injection of MCH centrally (ICV) stimulates food intake and MCH antagonizes the 
hypophagic effects seen with a-melanocyte stimulating hormone (ocMSH). (Qu et 
al, 1996. Nature 380, 243-247.) MCH-deficient mice are lean, hypophagic, and have 
increased metabolic rate. (Shimada et al, 199S. Nature 396, 670-673.) 

MCH action is not limited to modulation of food intake as effects on 
35 the hypothalamic-pituitary-axis have been reported. (Nahon 1994. Critical Rev. in 
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Neurobiol S, 221-262.) MCH may be involved in the body response to stress as 
MCH can modulate the stress-induced release of CRF from the hypothalamus and 
ACTH from the pituitary. In addition, MCH neuronal systems may be involved in 
reproductive or maternal function. 
5 Several references describe a receptor that is indicated to bind MCH. 

(Chambers et al, 1999. Nature 400, 261-265; Saito et al, 1999. Nature 400, 265-269; 
Bachner et al, 1999. FEBS Letters 457:522-524; Shimomura et al, 1999. 
Biochemical and Biophysical Research Communications 261, 622-626; and Lembo et 
al, 1999. Nat. Cell Biol 1, 267-271.) 

10 

SUMMARY OF THE INVENTION 

The present invention features melanin concentrating hormone receptor 

(MCH-R) chimeric and fusion proteins. MCH-R chimeric proteins comprise an 

MCH-R polypeptide region made up of at least two or more polypeptide regions 
15 characteristic of MCH-R found in different species. MCH-R fusion proteins comprise 

an MCH-R polypeptide region and a fluorescent protein region. 

An MCH-R polypeptide region provides a functional G-protein 

coupled receptor region able to bind MCH and transduce an intracellular signal. 

Examples of MCH-R polypeptide regions include naturally occurring MCH-R, 
20 chimeric MCH-R containing two or more regions from naturally occurring MCH-R, 

and functional derivatives thereof. 

Reference to the terms "characteristic" and "derivatives thereof 

describe a relationship to a reference sequence. In both cases, there is at least about 

75% sequence similarity to the reference sequence. 
25 Thus, a first aspect of the present invention describes a fusion protein 

comprising (a) an MCH-R polypeptide region and (b) a fluorescent polypeptide 

region. The fluorescent polypeptide region is joined directly, or though a polypeptide 

linker, to the carboxy side of the MCH-R polypeptide region. 

Another aspect of the present invention describes an MCH-R chimeric 
30 protein. The protein comprises: (a) an MCH-R binding region characteristic of a 

human MCH-R, (b) a transmembrane domain characteristic of a nonrhuman MCH-R, 

and (c) an intracellular domain characteristic of a non-human MCH-R. 

Another aspect of the present invention describes a nucleic acid 

encoding for an MCH-R fusion protein or an MCH-R chimeric protein described 
35 herein. Such nucleic acid comprises either a contiguous nucleotide sequence that 
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codes for the protein or a sequence that is processed by a host cell to produce a 
contiguous nucleotide sequence encoding for the protein. Processing of a nucleic acid 
sequence to produce a contiguous nucleotide sequence encoding for a protein can 
occur by the splicing together of exons resulting in intron removal. 
5 Another aspect of the present invention describes an expression vector 

comprising a nucleic acid encoding for an MCH-R fusion protein or an MCH-R 
chimeric protein described herein. 

Another aspect of the present invention describes a recombinant cell 
comprising nucleic acid encoding for an MCH-R fusion protein or an MCH-R 

10 chimeric protein described herein. The nucleic acid may be part of the host genome 
or may exist independently of the host genome. 

Another aspect of the present invention describes a non-human 
transgenic animal comprising nucleic acid encoding for an MCH-R fusion protein or 
an MCH-R chimeric protein described herein. 

15 Another aspect of the present invention describes a method for 

assaying for MCH-R active compounds by measuring the effect of a test preparation 
on one or more MCH-R activities. The method is performed using either an MCH-R 
fusion protein or an MCH-R chimeric protein described herein. 

Other features and advantages of the present invention are apparent 

20 from the additional descriptions provided herein including the different examples. 
The provided examples illustrate different components and methodology useful in 
practicing the present invention. The examples do not limit the claimed invention. . 
Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates aequorin assay results comparing a mouse MCH-R 
fusion with a human wild type MCH-R and a CMV-EGFP control. 

Figure 2 illustrates a cAMP flashplate assay of CHO cell clones stably 

30 expressing mMCH-lR-EGFP. Cells from individual clones were dissociated in 
enzyme free media and stimulated for 15 minutes at 37°C with human MCH at the 
indicated concentrations in the presence of 10 \iM forskolin. Cells were then lysed 
and assayed for bound [ I25 I]cAMP. Mouse MCH-1R-EGFP clones exhibited EC50 
values (0.1111, 0.1255, 0.1291, or 0.2304 nM) indistinguishable from that of a CHO 

35 cell clone expressing the wild-type human short isofonn of MCH-1R ( 0.1282 nM). 
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Figure 3 illustrates a cAMP flashplate assay of CHO cell clones stably 
expressing human short/mouse species chimeric MCH-1R-EGFP. Cells from 
individual clones were dissociated in enzyme free media and stimulated for 15 
minutes at 37°C with human MCH at the indicated concentrations in the presence of 
5 10 fiM forskolin. Cells were then lysed and assayed for bound [ I2:> I]cAMP. Human 
short/mouse species chimeric MCH-1R-EGFP clones exhibited EC50 values (0.0366, 
0.0462, 0.2117, or 0.2499 nM) indistinguishable from that of a CHO cell clone 
expressing the wild-type human short isoform of MCH-1R (0.1137 nM). 

10 DETAILED DESCRIPTION OF THE INVENTION 

The present invention features MCH-R chimeric and fusion proteins. 
Such proteins have a variety of different uses including being used as a research tool 
to study MCH-R function and dynamics, and being used to screen for MCH-R 
agonists and antagonists. 

15 The MCH-R provides a target to achieve different beneficial effects in 

a patient. Preferably, MCH-R activity is modulated to achieve one or more of the . 
following: weight loss, weight gain, treat cancer (e.g., colon or breast), reduce pain, 
treat diabetes, reduce stress, or teat sexual dysfunction. 

Modulation of MCH-R activity can be achieved by evoking a response 

20 at the MCH receptor or by altering a response evoked by an MCH receptor agonist or 
antagonist. Compounds modulating MCH-R receptor activity include agonists, 
antagonists, and allosteric modulators. Generally, MCH-R antagonists and allosteric 
modulators negatively affecting activity will be used to achieve weight loss, treat 
cancer (e.g., colon or breast), reduce pain, reduce stress, or teat sexual dysfunction; 

25 and MCH-R agonists and allosteric modulators positively affecting activity will be 
used to produce a weight gain. 

Preferably, MCH-R activity is modulated to achieve a weight loss or to 
treat diabetes in a patient. Diabetes mellitus can be treated by modulating MCH-R 
activity to achieve, for example, one or both of the following: enhancing glucose 

30 tolerance or decreasing insulin resistance. 

Excessive body weight is a contributing factor to different diseases, 
including hypertension, diabetes, dyslipidemias, cardiovascular disease, gall stones, 
osteoarthritis, and certain forms of cancers. Bringing about a weight loss can be used, 
for example, to reduce the likelihood of such diseases and as part of a treatment for 

35 such diseases. Weight reduction can be achieved by modulating MCH-R activity to 
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obtain, for example, one or more of the following effects: reducing appetite, 
increasing metabolic rate, reducing fat intake, or reducing carbohydrate craving. 

Increasing body weight is particularly useful for a patient having a 
disease or disorder, or under going a treatment, accompanied by weight loss. 
5 Examples of diseases or disorders accompanied by weight loss include anorexia, 
AIDS, wasting, cachexia, and frail elderly. Examples of treatments accompanied by 
weight loss include chemotherapy and radiation therapy. 

MCH-R Chimeric Proteins 

10 MCH-R chimeric proteins contain an MCH-R polypeptide region made 

up by at least two or more polypeptide regions characteristic of MCH-R found in 
different species. The different polypeptide regions that are present provide for an N- 
terminal extracellular domain; a transmembrane domain made up of transmembrane 
regions, extracellular loop regions, and intracellular loop regions; and an intracellular 

15 carboxy terminus domain. Examples of MCH-R amino acid sequences include the 
following: SEQ. ID, NO. 1 (human MCH1R long form), SEQ. ID. NO. 2 (human . 
MCH1R short form), and SEQ. ID. NO. 3 (mouse MCH1R). 

Preferably, the MCH-R chimeric protein comprises an MCH-R binding 
region characteristic of a human MCH-R along with transmembrane and intracellular 

20 domains characteristic of a non-human MCH-R. There are substantial amino acid 

differences between the N-terminus of the MCH-R present in humans and that present 
in other species such as mice. Such differences could result in, for example, the 
mouse MCH-R having different intrinsic properties and responsiveness to agonists 
and/or antagonists than the human MCH-R. The presence of a human MCH-R 

25 binding region provides for a "humanized" MCH-R chimeric receptor. 

The transmembrane and intracellular domains characteristic of a non- 
human MCH-R can be used in conjunction with a non-human host to provide a more 
naturally occurring environment for these regions. For example, an MCH-R chimeric 
having mouse transmembrane and intracellular domains are preferably used in murine 

30 cells lines or in transgenic mice. 

MCH-R chimeric proteins may contain regions other than extracellular, 
transmembrane, and intracellular domains that do not substantially decrease the 
activity of the protein. Preferably, additional regions do not cause a decrease of more 
than about 25% of MCH-R activity as measured using one or more of the assays 
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described in the examples provided below. Examples of additional regions that may 
be present include fluorescent protein regions and linker regions. 

In an embodiment of the present invention, the MCH-R chimeric 
protein comprises: (a) an MCH binding region characteristic of a first species and (b) 
5 a transmembrane and intracellular domain region characteristic of a second species 
joined directly, or though a linker, to the carboxy side of the MCH binding region. 
Preferably, the protein comprises, consists, or consists essentially of an MCH-R 
polypeptide having a sequence similarity of at least about 75%, at least 85%, or at 
least 95% with either SEQ. ID. NO. 4 (human short form/mouse species chimeric 

10 MCH1R) or SEQ. ID. NO. 5 (human long form/mouse species chimeric). Even more 
preferably, the protein comprises, consists essentially of, or consists of, SEQ. ID. NO. 
4 or SEQ. ID. NO. 5. 

Sequence similarity for polypeptides can be determined by BLAST. 
(Altschul et al, 1997. Nucleic Acids Res. 25, 3389-3402, hereby incorporated by 

15 reference herein.) In an embodiment of the present invention, sequence similarity is 
determined using tBLASTn search program with the following parameters: 
MATRIX:BLOSUM62, PER RESIDUE GAP COST: 1 1 , and Lambda ratio: 1 . 

Differences in naturally occurring amino acids are due to different R 
groups. An R group effects different properties of the amino acid such as physical 

20 size, charge, and hydrophobicity. Amino acids can be divided into different groups as 
follows: neutral and hydrophobic (alanine, valine, leucine, isoleucine, proline, 
tryptophan, phenylalanine, and methionine); neutral and polar (glycine, serine, 
threonine, tyrosine, cysteine, asparagine, and glutamine); basic (lysine, arginine, and 
histidine); and acidic (aspartic acid and glutamic acid). 

25 Generally, in substituting different amino acids it is preferable to 

exchange amino acids having similar properties. Substituting different amino acids 
within a particular group, such as substituting valine for leucine, arginine for lysine, 
and asparagine for glutamine are good candidates for not causing a change in 
polypeptide functioning. 

30 Changes outside of different amino acids groups can also be made. 

Preferably, such changes are made taking into account the position of the amino acid 
to be substituted in the polypeptide. For example, arginine can substitute more freely 
for nonpolor amino acids in the interior of a polypeptide then glutamate because of its 
long aliphatic side chain. (See, Ausubel, Current Protocols in Molecular Biology, 

35 John Wiley, 1987-1998, Supplement 33 Appendix 1C.) 



-6- 



WO 01/68706 



PCT/US01/08071 



MCH-R Fusion Proteins 

MCH-R fusion proteins contain an MCH-R polypeptide region and a 
fluorescent protein region either directly joined together or joined together through a 
5 linker. These regions provide MCH-R activity and a marker for evaluating MCH-R 
dynamics. 

An MCH-R polypeptide region provides functional MCH-R activity 
and includes naturally occurring MCH-R, chimeric MCH-R, and derivatives thereof. 
Preferred derivatives thereof have a sequence similarity of at least about 75%, at least 

10 about 85%, or at least about 95% to a naturally occurring MCH-R or a chimeric 
MCH-R described herein. 

A fluorescent protein region contains a chromophore that fluoresces. 
Preferably, the fluorescent protein region is the green fluorescent protein of the 
jellyfish Aequorea victoria or a derivative thereof. Preferred derivatives have a 

15 sequence similarity of at least about 75%, at least about 85%, or at least about 95% to 
the Aequorea victoria green fluorescent protein (GFP). The Aequorea victoria green 
fluorescent protein and examples of derivatives thereof are described by Cormack et 
ai, 1996. Gene 17, 33-38; Yang et al t 1996. Nucleic Acids Research .24, 4592-4593; 
Tsien et ai, U.S. Patent No. 5,625,048; Tsien et al, U.S. Patent No. 5,777,079; and 

20 Cormack et al, U.S. Patent No. 5,804,387 (each of which are hereby incorporated by 
reference herein). 

In different embodiments the MCH-R polypeptide region comprises, 
consists essentially of, or consists of, a sequence selected from the group consisting 
of: SEQ. ID. NO. 1, SEQ. ID. NO. 2, SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. 

25 NO. 5; and the fluorescent polypeptide region comprises, consists essentially of, or 
consists of, an amino acid sequence selected from the group consisting of SEQ. ID. 
NO. 6 (GFP), SEQ. ID. NO. 7 (EGFP), SEQ. ID. NO. 8 (Emerald), SEQ. ID. NO. 9 
(Topaz), and SEQ. ID. NO. 10 (Wlb). EGFP, Emerald, Topaz, and Wlb are 
derivatives of GFP. 

30 The optionally present linker is a polypeptide region that is preferably 

from 1 to about 100 amino acids in length. In different embodiments the linker is up 
to 75, 50 or 25 amino acids in length. 

Preferably, the MCH-R fusion protein comprises, consists essentially 
of, or consists of, the MCH-R polypeptide region and the fluorescent polypeptide 

35 region. More preferably, the protein comprises, consists essentially of, or consists of, 
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an amino acid sequence selected from the group consisting of: SEQ. ID. NO. 11 
(mouse MCHlR-linker-EGFP), SEQ. ID. NO. 12 (mouse MCH1R/EGFP direct 
fusion), SEQ. ID. NO. 13 (human short form/mouse species chimeric MCHlR-linker- 
EGFP), or SEQ. ID. NO. 14 (human long form/mouse species chimeric MCH1R- 
5 linker-EGFP). 

MCH-R Chimeric and Fusion Proteins Nucleic Acid and Expression 

MCH-R chimeric and fusion proteins can be produced using 

techniques well known in the ait. Preferably, such proteins are produced by 
10 recombinant expression inside a host cell by way of an expression vector or by way of 

nucleic acid integrated into the host genome. Examples of nucleic acid sequences 

encoding for MCH-R polypeptide regions, fluorescent protein regions, MCH-R 

chimeric proteins, and MCH-R fusion proteins are provided for by SEQ. ID. NOs. 15- 

29 (see Example 1, infra), 
15 Starting with a particular amino acid sequence and the known 

degeneracy of the genetic code, a large number of different encoding nucleic acid . 

sequences can be obtained. The degeneracy of the genetic code arises because almost 

all amino acids are encoded for by different combinations of nucleotide triplets or 

codons. The translation of a particular codon into a particular amino acid is well 
20 known in the art (see, e.g., Lewin GENES IV, p. 119, Oxford University Press, 1990). 

Amino acids are encoded for by codons as follows: 

A=Ala= Alanine: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGC, UGU 

D=Asp=Aspartic acid: codons GAC, GAU 
25 E=Glu=Glutamic acid: codons GAA, GAG 

F=Phe=Phenylalanine: codons UUC, UUU 

G=Gly=Glycine: codons GGA, GGC, GGG, GGU 

H=His=Histidine: codons CAC, CAU 

I=Ile=Isoleucine: codons AUA, AUC, AUU 
30 K=Lys=Lysine: codons AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 

M=Met=Methionine: codon AUG 

N=Asn=Asparagine: codons AAC, AAU 

P=Pro=Proline: codons CCA, CCC, CCG, CCU 
35 Q=Gln=Glutamine: codons CAA, CAG 
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R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 

S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 

T=Thr=Threonine: codons ACA, ACC, ACG, ACU 

V=Val=Valine: codons GUA, GUC, GUG, GUU 
5 W=Trp=Tryptophan: codon UGG 

Y=Tyr=Tyrosine: codons UAC, UAU 

Examples of techniques for introducing nucleic acid into a cell and 

expressing the nucleic acid to produce protein are provided in references such as 

AusubeL Current Protocols in Molecular Biology, John Wiley, 1987-1998, and 
10 Sambrook, et al y in Molecular Cloning, A Laboratory Manual, 2 nd Edition, Cold 

Spring Harbor Laboratory Press, 1989. 

An expression vector contains recombinant nucleic acid encoding for a 

polypeptide along with regulatory elements for proper transcription and processing. 

The recombinant nucleic acid contains two or more nucleic acid regions not naturally 
15 associated with each other. Exogenous regulatory elements such as an exogenous 

promoter can be useful for expressing recombinant nucleic acid in a particular host. 

Examples of expression vectors are cloning vectors, modified cloning vectors, 

specifically designed plasmids, and viruses. 

Generally, the regulatory elements that are present in an expression 
20 vector include a transcriptional promoter, a ribosome binding site, a terminator, and 

an optionally present operator. Another preferred element is a polyadenylation signal 

providing for processing in eukaryotic cells. Preferably, an expression vector also . 

contains an origin of replication for autonomous replication in a host cell, a selectable 

marker, a limited number of useful restriction enzyme sites, and a potential for high 
25 copy number. 

Expression vectors providing suitable levels of polypeptide expression 
in different hosts are well known in the art. Mammalian expression vectors well 
known in the art include pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl 
(Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593), pBPV- 1(8-2) 
30 (ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), 
pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), pCI- 
neo (Promega) and .lambda.ZD35 (ATCC 37565). Bacterial expression vectors well 
known in the art include pETl la (Novagen), lambda gtll (Invitrogen), pcDNAII 
(Invitrogen), and pKK223-3 (Pharmacia). Fungal cell expression vectors well known 
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in the art include pYES2 (Invitrogen) and Pichia expression vector (Invitrogen). 

Insect cell expression vectors well known in the art include Blue Bac HI (Invitrogen). 

Recombinant host cells may be prokaryotic or eukaryotic. Examples 

of recombinant host cells include the following: bacteria such as E. coli: fungal cells 
5 such as yeast; mammalian cells such as human, bovine, porcine, monkey, hampster, 

and rodent; and insect cells such as Drosophila and silkworm derived cell lines. 

Commercially available mammalian cell lines include L cells L-M(TK.sup.-) (ATCC 

CCL 1.3), L cells L-M (ATCC CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 

86), CV-1 (ATCC CCL 70),COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), 
10 CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), 

HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and 

MRC-5 (ATCC CCL 171). 

To enhance expression in a particular host it may be useful to modify 

the sequence to take into account codon usage of the host. Codon usage of different 
15 organisms are well known in the art. (See, Ausubel, Current Protocols in Molecular 

Biology, John Wiley, 1987-1998, Supplement 33 Appendix 1C.) 

Expression vectors may be introduced into host cells using standard 

techniques. Examples of such techniques include transformation, transfection, 

lipofection, protoplast fusion, and electroporation. 
20 Nucleic acid encoding for a polypeptide can be expressed in a cell 

without the use of an expression vector employing, for example, synthetic mRNA or 

native mRNA. Additionally, mRNA can be translated in various cell-free systems . 

such as wheat germ extracts and reticulocyte extracts, as well as in cell based systems, 

such as frog oocytes. Introduction of mRNA into cell based systems can be achieved, 
25 for example, by microinjection. 

Techniques for producing transgenic animals are well known in the art. 

Examples of such techniques are provided for by Teratocarcinomas and embryonic 

stem cells: a practical approach. Ed. By E. J. Robertson, IRL Press Limited, Oxford, 

England (1987); and Gene Targeting: a practical approach. Ed. By A. L. Joyner, 
30 Oxford University Press Inc. New York, NY (1993). 

G-Protein Coupled Receptor Assays 

MCH-R is G-protein coupled receptor. Techniques for measuring 
different G-protein activities, such as Gi/o, Gs, and Gq are well known in the art. 
35 MCH-R activity is preferably assayed for by measuring either Gi/o or Gq. 
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Gi/o and Gs activity can be measured using techniques such as a 
melonaphore assay, measuring cAMP production, measuring inhibition of cAMP 
accumulation, and measuring binding of 35S-GTP. cAMP can be measured using 
different techniques such as radioimmunoassay and indirectly by cAMP responsive 
5 gene reporter proteins . 

Gq activity can be measured using techniques such as those measuring 
intracellular Ca 2+ . Examples of techniques well known in the art that can be 
employed to measure Ca 2+ include the use of dyes such as Fura-2 and the use of Ca 2+ - 
bioluminescent sensitive reporter proteins such as aequorin. An example of a cell line 
10 employing aequorin to measure G-protein activity is HEK293/aeql7. (Button et 
al, 1993. Cell Calcium 14, 663-671, and Feighner et al, 1999. Science 284, 2184- 
21SS, both of which are hereby incorporated by reference herein.) 

Functional assays can be performed using individual compounds or 
preparations containing different compounds. A preparation containing different 
15 compounds where one or more compounds affect MCH-R chimeric or fusion protein 
activity can be divided into smaller groups of compounds to identify the compound(s) 
affecting MCH-R chimeric or fusion protein activity. In an embodiment of the 
present invention a test preparation containing at least 10 compounds is used in a 
functional assay. 

20 Functional assays can be performed using recombinantly produced 

MCH-R chimeric or fusion protein present in different environments. Such 
environments include, for example, cell extracts and purified cell extracts containing 
the MCH-R chimeric or fusion protein expressed from recombinant nucleic acid and 
an appropriate membrane for the polypeptide; and the use of a purified MCH-R 

25 chimeric or fusion protein produced by recombinant means that is introduced into a 
different environment suitable for measuring G-protein activity. 

Fluorescent Protein Assays 

Fluorescent protein joined to an MCH receptor can be employed to 
30 study different aspects of receptor dynamics including receptor sequestration, receptor 
densitization, and receptor localization. The fluorescent protein can .be used in in 
vitro or in vivo systems. 

In vitro applications of fluorescent proteins can be performed using 
techniques well known in the art. Examples of such techniques are provided by Barak 
35 et al, 1997. Mol Pharnu 5, 177-184; Tarasova et al, 1997. J. Biol Chem. 272, 
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14817-14824; Lin ex al, 1998. Mol Cell Endo. 146, 27-37; Tarasova ex al, 1998. J. 
Biol Chem. 273, 15883-15886; Kallal etal, 1998. J. Biol. Chan. 273, 322-328; 
Groake ex al, 1999. J. Biol Chem. 274, 23263-23269; Doherty ex al, 1999. 
Biochem. J. 341, 415-422; Brock ex al, 1999. Proc. Natl Acad. Sci. USA 96, 10123- 
5 10128; Cornea et al, 1999. Endocrinology 140, 4272-4280; and Lembo ex al, 1999. 
Nax. Cell Biol 1, 267-271 (these references are not admitted to be prior art to the 
claimed invention). 

In vivo applications of fluorescent proteins can be performed using 
techniques well known in the art. Examples of such techniques are provided by 
10 Mombaerts ex al, 1996. Cell 87, 675-686; Rodriquez ex al, 1999. Cell 97. 199-208; 
Spergel et al, 1999. J. Neurosci. I 2037-2050; and Zuo ex al, 1999. Proc. Natl. 
Acad. Set USA 96, 14100-14105 (these references are not admitted to be prior art to 
the claimed invention). 



15 EXAMPIJES 

Examples are provided below to further illustrate different features and 
advantages of the present invention. The examples also illustrate useful methodology 
for practicing the invention. These examples do not limit the claimed invention. 

20 Example 1: 

Amino acid and nucleic acid sequence information for SEQ. ID. NOs. 
1-29 are provided below. SEQ. ID. NOs. 1-29 include examples of polypeptide and 
encoding nucleic acid sequences for MCH-R polypeptide regions, fluorescent 
polypeptide regions, fusion proteins and chimeric proteins. In some cases the 
25 encoding nucleic acid is shown with additional nucleic acid upsteam or downstream 
from an open reading frame. 



SEQ. ID. NO. 1: Human long form MCH1R 

MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP 
30 AWVEGSSARLWEQATGTGWK'IDLEASLLPTGPNASNTSDGPDNLTSAGSPPR 
TGSIS\TMIMPSVFGTICL^^ 

LJJFLLGMPFMIHQ1JVIGNG VWHFGETM QFTST YILT AMAIDR 

YLATVHPISSTKFRKPSVATLW^ 

RIJNPDTDLYWFI^^ 
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TKRVTRTAIAICLVFFVCWAPYYVLQLTQI^ISRPTLTFV'i'LYNAAISLGYANS 
CLNPFVYIVLCETFRKRLVLSVKPAAQGQLRAVSNAQTADEERTESKGT 

SEQ. ID. NO. 2: Human short form MCH1R 
5 MDLEASIiPTGPNASNTSDGPDNLTSAGSPPRTGSIS\TNIIMPSVFGTICLLGnG 
NST\^AVVKKSKLiIWCNNWDIFIE^^ 

FGETMCTUTAMDANSQFTST\TLTAMATORYLATVHPISSTKFRKPSVATL\a 
GIXWALSHS1TPVWLYARUPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFA 
LPFWITAAYVRILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPY 
10 YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYTVLCETFRKRLVLSV 
KPAAQGQLRAVSNAQTADEERTESKGT 

SEQ. ID. NO. 3: Mouse MCH1R 

^LQASIiSTGPNASMSDGQDiNFTLAGPPPRTRSVS\TNinviPSVFGTICLLGI 
1 5 VGNSTVEFA VVKKSKLHWCSNVPDIFIINLS VVDLLFLLGMPFMIHQLMGNGV 
WHFGETMCTLITAMDANSQFTSTmTAMAroRYLATVFIPISSTia^ICPSMAT 
LVICIXWALSHSITPVWEYARIJPFPGGAVGCGIRIJ'NPDTDLYWFTLYQFFLA 
FAIJFVV1TAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA 
PYYVLQLTQI^ISRPTLTFVYLYNAMSLGYANSCU^FVYTVLCETFRKRLVLS 
20 VKPAAQGQLRTVSNAQTADEERTESKGT 

SEQ. ID. NO. 4: Human short form/mouse species chimeric MCH1R 
MDI^ASLIJ , TGPNASNTSDGPD^^LTSAGSPPRTGSIS\^^^IMPSVFGTICLLG^G 
NSTVIFAV VKKSKLJIWCNNYPDMINLS VVDL1PIXGMPFMIHQLMGNG VWH 
25 FGETMCTLITAMDANSQFTSTYILTAMAJDRYLATVHPISSTKFRKPSMATLVI 
CIXWALSHSITPVWXYARLIPFPGGAVGCGIRLPNPDTDLYVvTTLYQFFLAFA 
LPFVWAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPY 
YVLQLTQLSISPJPTLTFVYLYNAAISLGYANSCIJ^FVYIVLCETFRKRLVLSV 
KPAAQGQLRTVSNAQTADEERTESKGT 

30 

SEQ. ID. NO. 5: Human long form/mouse species chimeric MCH1R 

MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP 
AWATEGSSARLWEQATGTGWNIDLEASLLPTGPNASNTSDGPDNLTSAGSPPR 
TGSISYINIIMPSWGTICIXGnGNSTVIFAVVKKSKLIWC^^DimNLSV^ 
35 IXJ^GMPFMIHQLMGNGVWHFGETMCTIJrr AMD AN S QFTS T YILT AMAIDR 
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YLATVHPISSTKFRKPSMATLVICLLWALSHSITPVWLYARLIPFPGGAVGCGI 
RLPNTDTDLYWFTLYQFFLAFALPFVmAAYVKILQR^SSVAPASQRSIRLR 
TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTF\'YLYNAAISLGYANS 
CLNPFVYWLCETFRKRL\T-SVKPAAQGQLRTVSNAQTADEERTESKGT 

5 

SEQ. ID. NO. 6: GFP 

MSKGEEIJTGVWILVELDGDVNGHKFSVSGEGEGDATYGKLTLKnCTTGKL 
PWWPTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGN 
YKTRAEVKFEGDTLVNPJELKGIDFKEDGMLGHKLEYNYNSHNVYIMADKQ 
1 0 KNGKVNFBORHNIEDGS VQLADHYQQNTPIGDGPVLLPDNHYLSTQS ALSKD 
PNEKRDHMVT ,T .FFVTAAGITHGMDELYK 

SEQ. ID. NO. 7: EGFP 

NTVSKGEEIJTGVWILVEIJDGDVNGHKFSVSGEGEGDATYGia.TLKHCTTGK 
15 LP VPWPTLVTTLTYGVQCFS RYPDHMKQHDFFKS AMPEG YVQERTIFFKDDG 
NYKTRAEVKFEGDTLVNRJELKGroFKEDGNE.GHKIJEYNYNSHNV 
QKNGKVMTOPJIN1EDGSVQLADHYQQ 
DPN^KRDHMVLLEFVTAAGITLGMDELYK 

20 SEQ. ID. NO. 8 : Emerald 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

25 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg He Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Lys Val Tyr He Thr Ala Asp Lys Gin Lys 
Asn Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser Val Gin Leu 

30 Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
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SEQ. ED. NO. 9: Topaz 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe De Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
5 Leu Val Thr Thr Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Arg 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg He Glu Leu Lys Gly De Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr De Met Ala Asp Lys Gin Lys 
10 Asn Gly De Lys Val Asn Phe Lys De Arg His Asn De Glu Asp Gly Ser Val Gin Leu 
Ala Asp His Tyr Gin Gin Asn Thr Pro De Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly De Thr Leu Gly Met Asp Glu Leu Tyr Lys 

15 SEQ. ID. NO, 10: W1B 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro De Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly Glu Gly Glu. Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe De Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

20 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr De Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg De Glu Leu Lys Gly De Asp Phe Lys Glu Asp Gly Asn De Leu Gly 
His Lys Leu Glu Tyr Asn Tyr De Ser His Asn Val Tyr De Thr Ala Asp Lys Gin Lys 
Asn Gly De Lys Ala His Phe Lys De Arg His Asn De Glu Asp Gly Ser Val Gin Leu 

25 Ala Asp His Tyr Gin Gin Asn Thr Pro De Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly De Thr Leu Gly Met Asp Glu Leu Tyr Lys 

SEQ. ID. NO. 11: Mouse MCHlR-linker-EGFF 

30 MDLQASLI^TGPNASMSDGQDNFILAGPPPR^ 
VGNSTVIFAVVKKSKIJFIWCS^ 

WHFGETMCTLTT AMD ANS QFTST YKTAMAIDR YL AT VHPIS STKFRKPSMAT 
LVICIXWALSHSriPVWLYARL 
FALPFVVITAAYVTnLQRMTSSVAPASQ 
35 PYYVLQLTQLSISRPTLTFVYLWAAI 



-15- 



WO 01/68706 



PCT/US01/08071 



VKPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATM\'SKGEELFTGV 
VHLVELDGDVNGHKFSVSGEGEGDATYGKLTL™ 

TyGVQCFSRYPDHMXQHDFFKSAMPEGYVQERTIFFKDDGNYKTR.AE^^KFE 
GDTLVNPJELKGmFKFiDGMLGHKLEYNYNSHNVY^ 
5 HNTEDGSVQLADHYQQNTPIGDGPVUJPDNH^ 
LEFVTAAGITLGMDELYK 

SEQ. ID. NO. 12: Mouse MCH1R/EGFP direct fusion 

MDLQAS1XSTGPNASMSDGQDNFILAGPPPRTRSVSYINIIMPSVFGTICLLGI 

10 VGNSTVIFAVVKKSKIJIWCSNVPDIFn^SVVDlJJ^ 

WHFGETMCTLXTAN©ANSQFTSTYILTAMAIDRYLATVHPIS STKFRKPSMAT 
LVICLI.WALSHSITPVWLYAPJ.IPFPGGAVGCGIPJ.PNPDTDLY\\TTLYQFFLA 
FALPFVWAAYVKILQmTSSVAPASQRSIRmTKRVTRTAIAICLVFFVCWA 
PYYVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNTFVYRT^CETFIIIGILVLS 

15 VKPAAQGQLRTVSNAQTADEERTESKGTMVSKGEELFTGVVPE.\^ELDGDVN 
GHKFSVSGEGEGDATYGKLTLKHCTTGKLPWWPTLVTTLTYGVQCFSRYPD 
HMKQHDFFKSAMPEGYVQERTIFFKDDGN\TvTRAEVKFEGDTLVMimLKGI 
DFKEDGNILGHKLEYNYNSHNVITMAD^ VQLAD 
HnfQQNTPIGDGPVLLPDNHYL^TQSALSKDPN^KPJDHM\TJ^FVTAAGITLG 

20 MDELYK 

SEQ. ID. NO. 13: Human short form/mouse species chimeric MCHlR-linker- 
EGFP 

MDLiASIiPTGPNASNTSDGPDNLTSAGSPPRTGSISYINIIMPSVFGTICLLGIIG 

25 NSTWAWKKSKLHWCMsWPDIFTINL^V^ 

FGETMCTUTAMD ANSQFTST YILTAMAIDRYLATVHPIS STKFRKPSMATLVI 
CLLWALSHSriPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWrLYQFFLAFA 
LPFVVTTAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPY 
YVLQLTQLSISRPTLTFVYLl^AISLGYANSCI^FVYR/LCETFRKRLVLSV 

30 KPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGVV 
P^'EIJDGDVNGHKFSVSGEGEGDATYGKLllJCHCTTGKLPypWPTLVTTLT 
YGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYICTRAEVKFEG 
DTLVNT^LKGIDFKEDGNILGHK^ 

NIEDGSVQLADHYQQNTPIGDGPVIXPDNHYLSTQSALSKDPNEKRDHMVLL 
35 EFVTAAGITLGMDELYK 
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SEQ. ED. NO. 14: Human long form/mouse species chimeric MCHlR-linker- 
EGFP 

MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP 
5 AWVEGSSARLWEQATGTGWMDIEASLLPTGPNASNTSDGPDNLTSAGSPPR 
TGSISYINIIMPSWGTIClI.GnGNST\aFAVVKKSKLiIWCN^ 
IJJ^GMPFMmQLMGNGVWHFGEmCTIJTAMDANSQFrSTmTAMAIDR 
YLATVHPISSTKFRKPSA'IATLVICLLWALSHSITP\^VLYARLffFPGGAVGCGI 
RI^NPDTDLYWFTLYQFPLAFALPFVVTrA AYVKILQRMTS S VAPASQRSIPJLR 

1 0 TBOIVTRTAIAICLVFPVCWAPYYVLQLTQLSISRPTLTFA^YLYNAAISLG YANS 
CI^FAnOVLCETFRKRLVLSVKPAAQGQLRTVSNAQTADEERTESKGTVDGT 
AGPGSIATMVSKGEET.FTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLT 
LmCTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQE 
RTIFFXDDGNYKTRAEVKFEGDTLVNRIEIJKGIDFKEDGNn^ 

15 m^\TMADKQKNGIKVNFKIRHNIEDGSVQ]^HYQQKTTIGDGP\aXPDNHY 
LSTQSAiSKDPNEKRDHMVlXEFVTAAGITLGMDELYK 

SEQ. ID. NO. 15. Human long form MCH1R cDNA 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 

20 GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG 
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT 

25 CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC 
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG 
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT 
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA 

30 TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC 
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC 
ATCTCTTCCACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGC 
CTCCTGTGGGCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCC 
AGACTCATCCCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCC 

35 CAACCCAGACACTGACCTCTACTGGTTCACCCTGTACCAGTTTTTCCTGGC 
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CTTTGCCCTGCCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCA 
GCGCATGACGTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGC 
GGACAAAGAGGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTT 
GTGTGCTGGGCACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGC 
5 CGCCCGACCCTCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGC 
TATGCCAACAGCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACG 
TTCCGCAAACGCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCT 
TCGCGCTGTCAGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGC 
AAAGGCACCTGA 

SEQ. ID. NO. 16: Human short form MCH1R cDNA 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 

1 5 GCCTCCTGGGC ATCATCGGGA ACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA 
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 

20 ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGCCTCCTGTGG 
GCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCCAGACTCATC 
CCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCCCAACCCAGA 
CACTGACCTCTACTGGTTCACCCTGTACCAGTTTTTCCTGGCCTTTGCCCTG 

25 CCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCAGCGCATGAC 
GTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGCGGACAAAGA 
GGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG 
CACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGCCGCCCGACCC 
TCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGCTATGCCAACA 

30 GCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACGTTCCGCAAAC 
GCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCTTCGCGCTGTC 
AGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCT 
GA 
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SEQ. ED. NO. 17: Mouse MCH1R cDNA 

Nucleic acid sequence start and stop codons are highlighted: 

GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA 
AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG 
5 CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA 
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA 
GGATAATTTCACATTGGCGGGGCCACCTCCTCGCACAAGGAGTGTCTCCT 
ACATCAACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCA 
TTGTGGGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTG 

10 CACTGGTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTG 
GATCTGCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGT 
AATGGTGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCAT 
GGACGCCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCA 
TTGACCGCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGA 

1 5 AGCCCTCCATGGCC ACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCA 
TTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGG 
GTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACT 
GGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCAT 
CACTGCTGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCC 

20 CAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACA 
GCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTG 
CTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTAC 
CTGTACAATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCC 
CTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTC 

25 GGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGA 
CAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCTGACAATCCCCCCC 
GGTCACCTCCAAGTCAGGTCACCGCATCAAACCATGGGGAGAGATACTGA 
GATAAACCCGGGGCTACCCTGGGAGGATGCAGAAGCTGGAGGCTGGGGG 
CTTGTAGCAAACCACATTCCACGGGGCCCACAAATTGCTAGGGAGGCTTG 

30 CAGCCTGGTTTGGGGGGGAAGCCTCAGACTGCAGGGATCCCCTTGACAGA 
ATAGAAGCGGAGCAAGAAGGAAAGGGTGGTTTGACTGGTTCTCGGGGTCT 
GTATCTGTTGGCTCGCATATATCTTTCTCTCAAGGGAAGAAGGCGGAGGT 
GCCTAGCTGGGTTCCTTTAAAACTAGGCAGGGCTAGGATCTGAGCAGCTA 
GGGCTCTACTGTGAGACTGGGCAAGCCGAGCGTTCCCTCCCATCTCTCATT 

35 GGTGTTGATAGAAGGCAGTCTTTCTCCCAAGCTGGTGGATCTCCTGAAGC 
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ACGCTGCCTGGGCTCCAGCATCCTGTGCGGATTTCACGTTCTCTTTAGGGG 
ATGCATGTTGACACTGGGGTGTGGGCTCTGAGCCCACAGGAGTTTAAAAA 
ACCAAAAGAGCTCAGAGTGTCGAGAGAGACCCAATCACCGAGAATGACA 
AGGCAACCTGGGGTGGATGTGGATCTTGAAACTAATAAAAAGGGGTTTTC 
5 ACAGTGACAGCGACATTCTCTTCATAGGGCACAGCTGTCAGTCTATGGCT 
GATCCAGAGCGAGCATCCATGAATTCTGCATGTGCAGGGGTCACTCTAAT 
ACCTGATATGTTGGCATCATCTTTGTGCTTGAGCCTTCCNCTCCCAAATGG 
GAATGAAATAAAGGCAAATTCCCNCCCCCCCCAAAAAAGGGGNAAAAAA 
AAAAAAAAAAAAAAAAAAAA 

10 

SEQ. ID. NO. 18: Mouse MCH1R genomic DNA 

Nucleic acid sequence start and stop codons, as well as intron borders, are 
highlighted: 

GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA 

15 AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG 
CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA 
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA 
GGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTCCTCCCTCCTCCG 
GGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAGCCAAACTGCCTG 

20 GGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCAGTCGGCTTTGCC 
TGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGACAGAAGGGGCA 
GGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAACAACAAGGGTTG 
GTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGCTGATGAGTCCCA 
AATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGGCTGGAGAAGCAA 

25 GCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGTGGGAGCCAGCAG 
AAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTCTGGCCTCGCACTG 
GGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGCAAAGGCTGCTGG 
GAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTTTCTGGTGAATTA 
AGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATTCTAGCTCTGAAG 

30 GAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGTGGAGAAATTCCC 
AAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCTATTGGGAAAGCC 
CAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACAGAAAACCGGGAG 
AGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACCCATGGTGACAGTC 
ACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTTGGCCATCTCTCTC 

35 TGAAAAGATGAGGCTGCAAATACACACACACACACACACACACACACAC 
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ACACACACACACACACACACACACACACACACACAAATGTCCTTCAAGCC 
TTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGAAGTTGTTCTCAGC 
AGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAAATGGATGAAGCG 
AAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTAGAAAATCCTATAG 
5 GAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACAGAACTAACAATAG 
CCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAGACGTAGTGACCCC 
AAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAGGCCGACATGGCAG 
GTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTGCTCTTTCTATCCTC 
AGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATCAACATCATCATGC 

1 0 CTTC AGTGTTTGGTACCATCTGTCTCCTGGGC ATTGTGGG AAACTCC ACAG 
TCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTGGTGCAGCAACGTC 
CCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCTGCTTTTCCTGCTGG 
GCATGCCTTTCATGATCCACCAGCTCATGGGTAATGGTGTCTGGCACTTTG 
GGGAAACCATGTGCACCCTCATCACAGCCATGGACGCCAACAGTCAGTTC 

1 5 ACCAGCACCTACATCCTGACTGCTATGGCCATTGACCGCTACTTGGCCACC 
GTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCTCCATGGCCACCCTG 
GTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGG 
CTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATC 
CGCCTACCAAACCCAGATACTGATCTTTACTGGTTCACTCTGTATCAGTTT 

20 TTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAA 
ATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCAT 
CCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGG 
TCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGT 
CCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCA 

25 GCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCT 
GTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAG 
GGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGA 
CAGAAAGCAAAGGCACCTGACAATCCCCCCCGGTCACCTCCAAGTCAGGT 
CACCGCATCAAACCATGGGGAGAGATACTGAGATAAACCCGGGGCTACC 

30 CTGGGAGGATGCAGAAGCTGGAGGCTGGGGGCTTGTAGCAAACCACATTC 
CACGGGGCCCACAAATTGCTAGGGAGGCTTGCAGCCTGGTTTGGGGGGGA 
AGCCTCAGACTGCAGGGATCCCCTTGACAGAATAGAAGCGGAGCAAGAA 
GGAAAGGGTGGTTTGACTGGTTCTCGGGGTCTGTATCTGTTGGCTCGCATA 
TATCTTTCTCTCAAGGGAAGAAGGCGGAGGTGCCTAGCTGGGTTCCTTTA 

35 AAACTAGGCAGGGCTAGGATCTGAGCAGCTAGGGCTCTACTGTGAGACTG 
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GGCAAGCCGAGCGTTCCCTCCCATCTCTCATTGGTGTTGATAGAAGGCAG 
TCTTTCTCCCAAGCTGGTGGATCTCCTGAAGCACGCTGCCTGGGCTCCAGC 
ATCCTGTGCGGATTTCACGTTCTCTTTAGGGGATGCATGTTGACACTGGGG 
TGTGGGCTCTGAGCCCACAGGAGTTTAAAAAACCAAAAGAGCTCAGAGTG 
5 TCGAGAGAGACCCAATCACCGAGAATGACAAGGCAACCTGGGGTGGATG 
TGGATCTTGAAACTAATAAAAAGGGGTTTTCACAGTGACAGCGACATTCT 
CTTCATAGGGCACAGCTGTCAGTCTATGGCTGATCCAGAGCGAGCATCCA 
TGAATTCTGCATGTGCAGGGGTCACTCTAATACCTGATATGTTGGCATCAT 
CTTTGTGCTTGAGCCTTCCNCTCCCAAATGGGAATGAAATAAAGGCAAAT 
10 TCCCNCCCCCCCCAAAAAAGGGGNAAAAAAAAAAAAAAAAAAAAAAAA 
AA 

SEQ. ID. NO. 19: Human short form/mouse species chimeric MCH1R 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 

15 CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA 

20 CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 
ACCGCCATGCJCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG 
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT 

25 CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG 
ATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCT 
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC 
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA 
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG 

30 CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC 
CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC 
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAA 
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT 
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC 

35 TGA 
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SEQ. ID. NO. 20: Human long form/mouse species chimeric MCH1R 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 
5 GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG 

ccagcaacacctctgatggccccgataacctcacttcggcaggatcacct 
cctcgcacggggagcatctcctacatcaacatcatcatgccttcggtgttc 

1 0 ggc acc atctgcctcctgggcatcatcgggaactccacggtc atcttcgcg 
gtcgtgaagaagtccaagctgcactggtgcaacaacgtccccgacatctt 
catcatcaacctctcggtagtagatctcctctttctcctgggcatgccctt 
catgatccaccagctcatgggcaatggggtgtggcactttggggagacca 
tgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc 

15 tacatcctgaccgccatggccattgaccgctacctggccactgtccacccc 
atctcttccacgaagttccggaagccctccatggccaccctggtgatctgc 
ctcctgtgggctctctcgttcattagcatcactcctgtgtggctctatgcc 
aggcttatccccttcccagggggtgctgtgggctgtggcatccgcctacca 
aacccagatactgatctttactggttcactctgtatcagtttttcctggcct 

20 tcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagc 
gcatgacgtcttcggtggccccagcctctcaacgcagcatccggcttcgg 
acaaagagggtgacccgcacagccattgccatctgtctggtcttctttgtg 
tgctgggcgccctactacgtgctgcagctgacccagttgtccatcagccgc 
ccgaccctcacattcgtctacctgtacaatGcggccatcagcttgggctat 

25 gccaacagctgcctcaatccctttgtgtacatagtactctgtgagaccttt 
cgaaaacgcttggtgctgtcggtgaagcccgcggcccaggggcagcttcg 
cacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaa 
ggcacctga 

30 SEQ. ID. NO. 21: Aequorea victoria Green Fluorescent Protein (GFP) cDNA 
Nucleic acid sequence start and stop codons are highlighted: 

TACACACGAATAAAAGATAACAAAGATGAGTAAAGGAGAAGAACTTTTC 
ACTGGAGTTGTCCCAATTCTTGTTGAATTAGATGGTGATGTTAATGGGCAC 
AAATTTTCTGTCAGTGGAGAGGGTGAAGGTGATGCAACATACGGAAAACT 
35 TACCCTTAAATTTATTTGCACTACTGGAAAACTACCTGTTCCATGGCCAAC 
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ACTTGTCACTACTTTCTCTTATGGTGTTCAATGCTTTTCAAGATACCCAGAT 
CATATGAAACAGCATGACTTTTTCAAGAGTGCCATGCCCGAAGGTTATGT 
ACAGGAAAGAACTATATTTTTCAAAGATGACGGGAACTACAAGACACGTG 
CTGAAGTCAAGTTTGAAGGTGATACCCTTGTTAATAGAATCGAGTTAAAA 
5 GGTATTGATTTTAAAGAAGATGGAAACATTCTTGGACACAAATTGGAATA 
CAACTATAACTCACACAATGTATACATCATGGCAGACAAACAAAAGAATG 
GAATCAAAGTTAACTTCAAAATTAGACACAACATTGAAGATGGAAGCGTT 
CAACTAGCAGACCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGT 
CCTTTTACCAGACAACCATTACCTGTCCACACAATCTGCCCTTTCGAAAGA 

10 TCCCAACGAAAAGAGAGACCACATGGTCCTTCTTGAGTTTGTAACAGCTG 
CTGGGATTACACATGGCATGGATGAACTATACAAATAAATGTCCAGACTT 
CCAATTGACACTAAAGTGTCCGAACAATTACTAAAATCTCAGGGTTCCTG 
GTTAAATTCAGGCTGAGATATTATTTATATATTTATAGATTCATTAAAATT 
GTATGAATAATTTATTGATGTTATTGATAGAGGTTATTTTCTTATTAAACA 

15 GGCTACTTGGAGTGTATTCTTAATTCTATATTAATTACAATTTGATTTGACT 
TGCTCAAA 

SEQ. ID. NO. 22: EGFP + Linker 

Nucleic acid sequence start and stop codons are highlighted and a 12 amino acid 

20 linker sequence is denoted in lower case: 

gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT 
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC 
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA 
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC 

25 CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC 
CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG 
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA 
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG 
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC 

30 TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG 
AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG 
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC 
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT 
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG 
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TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGC 
GGCCGC 

SEQ. ID. NO. 23: Emerald 
5 ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 
CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG 
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTGACCT 
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGAC 

10 TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA 
AGGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC 

15 AAGACCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACT 
ACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAAC 
CACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCG 
CGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCG 
GCATGGACGAGCTGTACAAGTAA 

20 

SEQ. ID. NO. 24: Topaz 

ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 

CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG 

GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 

25 CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCT 
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGCGCCAGCACGAC 
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 

30 GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA 
ACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC 
AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA 
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC 
ACTACCTGAGCTACCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC 
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GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG 
CATGGACGAGCTGTACAAGTAA 

SEQ. ID. NO. 25: W1B 
5 ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 
CGAGCTGGACGGCGACGTAAACGGCCACAGGTTCAGCGTGTCCGGCGAG 
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT 
GGGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC 

10 TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGTACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACATCAGCCACA 
ACGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGCCCACTTC 

1 5 AAGATCCGCCACA ACATCG AGGACGGCAGCGTGCAGCTCGCCG ACCACTA 
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC 
ACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC 
GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG 
CATGGACGAGCTGTACAAGTAA 

20 

SEQ. ID. NO. 26. Mouse MCHIR-Iinker-EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, as well as intron borders, are highlighted and a 12 amino acid 
linker sequence is denoted in lower case: 

25 ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT 
CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC 
CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG 
CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA 
GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA 

30 CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC 
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC 
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG 
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT 
GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC 

35 TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC 
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AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT 
TCTGGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATT 
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT 
GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT 
5 ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA 
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC 
CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTT 
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC 
ACACACACACACACACACACACACACACACACACACACACACACACAAA 

1 0 TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGG ATATG A 
AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA 
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA 
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA 
GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG 

15 ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG 
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG 
CTCTTTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC 
AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG 
GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG 

20 GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT 
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG 
TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG 
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC 
GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT 

25 CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA 
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG 
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA 
CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC 
TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT 

30 CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT 
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTAGGTGCTGCAG 
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC 
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG 
TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA 

35 GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG 
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ACGAGGAGAGGACAGAAAGCAAAGGCACCgtcgacggtaccgcgggcccgggatccatcg 
ccaccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCC 
TGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGC 
GAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTG 
5 CACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGA 
CCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCAC 
GACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCAT 
CTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCG 
AGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAG 

10 GAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCC 
ACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAA 
CTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACC 
ACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGAC 
AACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAA 

1 5 GCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTC 
TCGGCATGGACGAGCTGTACAAGTAA 

SEQ. ID. NO. 27: Mouse MCH1R/EGFP direct fusion 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 

20 EGFP, respectively, as well as intron borders, are highlighted: 

ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT 
CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC 
CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG 
CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA 

25 GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA 
CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC 
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC 
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG 
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT 

30 GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC 
TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC 
AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT 
TCTGGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATT 
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT 

35 GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT 
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ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA 
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC 
CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTT 
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC 
5 ACACACACACACACACACACACACACACACACACACACACACACACAAA 
TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA 
AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA 
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA 
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA 

10 GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG 
ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG 
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG 
CTCTTTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC 
AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG 

1 5 GGAAACTCC AC AGTC ATTTTTGCCGTGGTGAAG A A ATCC A AGCTGC ACTG 
GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT 
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG 
TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG 
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC 

20 GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT 
CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA 
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG 
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA 
CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC 

25 TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT 
CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT 
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG 
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC 
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG 

30 TACATAGTACTCTGTG AGACCTTTCGA AAACGCTTGGTG CTGTCGGTGAA 
GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG 
ACGAGGAGAGGACAGAAAGCAAAGGCACCATGGTGAGCAAGGGCGAGG 
AGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTA 
AACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCT 

35 ACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTG 

-29- 



WO 01/68706 



PCT/USO 1/080 71 



CCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAG 
CCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGC 
CCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAAC 
TACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACC 
5 GCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG 
GCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCG 
ACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACAT 
CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCC 
ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCA 
10 GTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGC 
TGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTAC 
AAGTAA 

SEQ. ID. NO. 28: Human short form/mouse species chimeric MCHlR-linker- 
15 EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in 
lower case: 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 

20 CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA 

25 CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG 
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT 

30 CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG 
ATACTGATCITrACTGGTTCACrCTGTATCAGTTTTTCCTGGCCTTCGCCCT 
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC 
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA 
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG 

35 CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC 
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CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC 
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAA 
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT 
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC 
5 gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT 
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC 
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA 
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC 
CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC 

10 CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG 
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA 
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG 
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC 
TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG 

1 5 AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCAC A ACATCGAGGACG 
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC 
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT 
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG 
TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA 

20 

SEQ. ED. NO. 29: Human long form/mouse species chimeric MCHIR-Iinker- 
EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in 
25 lower case: 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 

30 CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG 
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT 
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC 
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG 
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 

35 CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT 
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CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA 
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC 
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC 
ATCTCTTCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC 
5 CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC 
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA 
AACCCAGATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCT 
TCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC 
GCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG 

10 ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG 
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC 
CCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT 
GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT 
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG 

1 5 CACGGTCAGCAATGCTC AGACAGCTGACGAGGAGAGGACAG A AAGCAAA 
GGCACCgtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGA 
GCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAA 
ACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAC 
GGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCC 

20 CTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCC 
GCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCC 
GAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTA 
CAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGC 
ATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGC 

25 ACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGAC 
AAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCG 
AGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATC 
GGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTC 
CGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGG 

30 AGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAG 
TAA 

Example 2: Generation of Chimeric and Fusion Proteins 

DNA vectors encoding fusion proteins between a MCH-R receptor 
35 (MCH1R) and several different superbright variants of Green Fluorescent Protein 
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(GFP) were generated. GFP variants were fused either via a 12 amino acid linker: 
TCGACGGTACCGCGGGCCCGGGATCCATCGCCACC (SEQ. ID. NO. 30), 
amino acid sequence: VDGTAGPGSIAT (SEQ. ID. NO. 31) (linker fusions) or 
directly to the C-terminus of MCH1R (direct fusions). 

5 

Mouse MCHlR-linker-GFP Variant Fusion Constructs 
Initially, mouse MCH1R was fused in frame via the linker to Enhanced 
Green Fluorescent Protein (EGFP). MCH1R was PCR-ampIified (95°C for 5 
minutes; 95°C for 30 seconds, 60°C for 45 seconds, 68°C for 3.5 minutes, for 15 

10 cycles; 68°C for 7 minutes) from a full-length mouse MCH1R genomic DNA lambda 
clone utilizing a high fidelity polymerase mix (Expand High Fidelity PCR System 
from Boehringer Mannheim) and PCR primers [MCH1R (Eco RI) 5': 
GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32), MCH1R (Sal I) 
3': GCGTCGACGGTGCCTTTGCTTTCTGTCC (SEQ. ID. NO. 33)] that generated 

15 Eco RI and Sal I enzymatic restriction sites at the N- and C-terminus, respectively. 
The MCH1R N-terminal PCR primer was also designed to introduce a Kozak 
consensus sequence for translation which contained an Nco I site (5'-ACCATGG-3'), 
and the MCH1R C-teiminal PCR primer was also designed to eliminate the 
endogenous stop codon present in the mouse MCH1R gene. The resulting PCR 

20 product was phenol/chloroform extracted, restriction digested with Eco RI and Sal I, 
gel purified, and subcloned in frame into the multicloning site of Clontech's pEGFP- 
N3 vector between Eco RI and Sal I sites. Several resulting clones for this construct 
were sequenced to identify a clone with an entirely correct nucleotide sequence. This 
clone was named mMCHlR-l-EGFP for mouse MCHlR-linker-EGFP. 

25 An approximately 760 bp Sal I to Not I fragment of mMCHlR-l-EGFP 

was excised, gel purified, and subcloned into the multicloning site of pBluescript 
(SK+) (Stratagene) between Sail and Not I sites. An approximately 710 bp Nco I to 
Bsr Gl fragment of EGFP was excised from the resulting pBluescript-EGFP vector 
and replaced with the corresponding Nco I to Bsr Gl fragment of either Emerald, 

30 Topaz, or W1B (other superbnght GFP variants), which were excised from vectors 
pRSET-Emerald, pRSET-Topaz, and pRSET-WlB, respectively. p.RSET-Emerald, 
pRSET-Topaz, and pRSET-WlB were obtained from Aurora Biosciences Co. Sal I to 
Not I fragments containing either Emerald, Topaz, or W1B were excised from the 
resulting pBluescript-Emerald, pBluescript-Topaz, and pBluescript-WlB vectors, 

35 respectively. Appropriate fragments were gel purified and subcloned into mMCHlR- 
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1-EGFP digested with Sal I and Not I, replacing the Sal I to Not I EGFP fragment with 
the corresponding Sal I to Not I fragment from either Emerald, Topaz, or W1B. 
Several clones for each construct were sequenced to confirm the presence of the 
appropriate GFP variant. The resulting vectors were named mMCHlR-l-Emerald, 
5 mMCHlR-l-Topaz, and mMCHlR-l-WlB for mouse MCHlR-linker-Emerald, 
mouse MCHlR-linker-Topaz, and mouse MCHlR-linker-WlB, respectively. 

Mouse MCH1R/GFP Variant Direct Fusion Constructs 
A two step PCR strategy was employed to generate the direct fusion 

10 constructs. First, mouse MCH1R, EGFP, and Emerald were PCR-amplified from a 
full-length mouse MCH1R genomic DNA lambda clone, Clontech's pEGFP-N3 
vector, and Aurora's pRSET-Emerald vector, respectively. Mouse MCH1R was 
PCR-amplified according to the previously mentioned conditions utilizing the same 
N-terminal PCR primer [MCH1R (Eco RI) 5': GCGAATTCACCATGGATCTGCA 

15 AGCCTCG (SEQ. ID. NO. 32)], but in this case a different C-terminal PCR primer 
was employed. The C-terminal PCR primer [MCH1R (EGFP/Emerald) 3 5 : 
CCTTGCTCACCATGGTGCCTTTGCTTTCTGTCC (SEQ. ID, NO. 34)] eliminated 
the endogenous stop codon of mouse MCH1R as before and introduced a region of 
nucleotide sequence complementary to the nucleotide sequence of the N-terminus of 

20 EGFP. 

EGFP and Emerald were PCR-amplified (95°C for 5 minutes; 95°C for 
30 seconds, 60°C for 45 seconds, 68°C for 1.5 minutes, for 15 cycles; 68°C for 7 
minutes) separately with a high fidelity polymerase mix (Advantage HF-2 from 
Clontech) from their respective templates utilizing a common N-teiminal PCR primer 

25 [EGFP/Emerald (MCH1R) 5': CAGAAAGCAAAGGCACCATGGTGAGCAA 
GGGCGAGGAGC (SEQ. ID. NO. 35)] that generated a region of nucleotide 
sequence complementary to the C-terminus of mouse MCH1R and C-terminal PCR 
primers [EGFP 3': GGCGGATCCTCTAGAGTCGCGGCC (SEQ. ID. NO. 36), or 
Emerald (EGFP) 3': GCTCTAGAGTCGCGGCCGCTTACTTGTACAGCTCGTCC 

30 (SEQ. ID. NO. 37)] that generated a Not I site at the C-terminus. The resulting PCR 
products were electrophoresed on an agarose gel and the appropriatefragments were 
gel purified. 

In a second PCR step, PCR reactions were set up between the 
previously generated mouse MCH1R and EGFP, or mouse MCH1R and Emerald 
35 PCR products. Following an initial 5 minute denaturation step at 95°C, two rounds of 
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thermocycling (95°C for 30 seconds, 60°C for 45 seconds, 68°C for 4 minutes) were 
performed in the absence of PCR primers. This allowed the mouse MCH1R and GFP 
variants to anneal at their complementary regions and to be filled in by the high 
fidelity polymerase mix (Expand High Fidelity PCR System from Boehringer 
5 Mannheim), yielding double stranded template DNA. 

Subsequently, the common N-terminal mouse MCH1R [MCH1R (Eco 
RI) 5': GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32)] and 
appropriate C-terminal PCR primers [EGFP 3': GGCGGATCCTCTAGAGTC 
GCGGCC (SEQ. ID. NO. 36) or Emerald (EGFP) 3': GCTCTAGAGTCGCGG 

10 CCGCTTACTTGTACAGCTCGTCC (SEQ. ID. NO. 37)] were added to the 

reactions and thermocycling was continued for an additional fifteen cycles followed 
by a final extension at 68°C for 7 minutes. The resulting PCR products were 
phenol/chloroform extracted, restriction digested with Eco RI and Not I, 
electrophoresed on an agarose gel, and appropriate fragments were gel purified. 

15 These Eco RI to Not I fragments represent direct fusions between 

either mouse MCH1R and EGFP, or mouse MCH1R and Emerald. Clontech's 
pEGFP-N3 vector was restriction digested with Eco RI and Not I liberating an 
approximately 780 bp Eco RI to Not I EGFP fragment. This restriction digest was 
electrophoresed on an agarose gel and the approximately 3.9 Kb pEGFP-N3 vector 

20 backbone was gel purified. Eco RI to Not I mouse MCH1R/EGFP or mouse 

MCHIR/Emerald direct fusion fragments were subcloned into the pEGFP-N3 vector 
backbone between Eco RI and Not I sites. Several resulting clones for each of these 
two constructs were sequenced to identify clones with correct nucleotide sequence; 
however, no clones with entirely correct nucleotide sequences were identified. 

25 Fortunately, several clones for each of the two constructs only had nucleotide 

mismatches in the intron region of mouse MCH1R, and therefore, were not expected 
to effect the functionality of the resulting fusion proteins. These clones were named 
mMCHlR/EGFP and mMCHlR/Emerald for mouse MCH1R/EGFP direct fusion and 
mouse MCHIR/Emerald direct fusion, respectively. 

30 

Human Short and Long Form/Mouse Species Chimeric 

MCHIR-Iinker-GFP Variant Fusion Constructs 
The initial mouse MCHIR-Iinker-GFP variant fusion constructs were 
modified to generate both human short form and human long form/mouse species 
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chimeric MCHlR-linker-GFP variant fusion constructs. An approximately 1.7 kb 
Hind IE to Bsp EI fragment of the mouse MCH1R gene containing exon 1, the intron, 
and 127 amino acids of exon 2 was excised from the various mouse MCHlR-linker- 
GFP variant fusion constructs and replaced by either an approximately 470 bp Hind 
5 HI to Bsp EI fragment from the wild-type human MCH1R short form or an 

approximately 670 bp Hind HI to Bsp EI fragment from the wild-type human MCH1R 
long form. 

Several clones for each construct were sequenced to confirm the 
presence of the N-terminal region of either the human MCH1R short or long forms. 
10 These clones were named hshort/mMCHlR-l-GFP variant or hlong/mMCHlR-l-GFP 
variant for human short form/mouse species chimeric MCHlR-linker-GFP variant 
and human long form/mouse species chimeric MCHlR-linker-GFP variant, 
respectively. 

15 Example 3: Functional Evaluation of MCH1R/GFP Variant Fusion Proteins 

Both HEK293 Aequorin (National Institutes of Health) and CHO . 
mammalian cell lines were transiently transfected with the various MCH1R/GFP 
variant fusion constructs, as well as the appropriate control constructs. Transfection 
was performed using Lipofectamine 2000 (Gibco BRL) per the manufacturer 

20 recommended protocol. Approximately 48 hours after transfection cells were 

harvested, stimulated with various concentrations of human MCH, and assayed for 
either aequorin bioluminescence (HEK293 Aequorin cells) or cAMP production 
(CHO cells). Aequorin bioluminescence is a representative measure of intracellular 
Ca 2+ mobilization. cAMP production was measured with the Adenylyl Cyclase 

25 Activation FlashPlate Assay (NEN Life Science Products, Inc.). 

Following transient transfection of the mMCHlR-linker-EGFP 
construct (MCH-R-1-EGFP) into HEK293 Aequorin cells, the resulting fusion protein 
exhibited functional activity comparable to that of the wild-type human MCH1R short 
form (MCH-R wt). By this functional assay, the EC50 value for mMCHlR-I-EGFP 

30 was nearly identical to that of the wild-type human short form receptor (Figure 1). 

Following transient transfections of the mMCHlR-l-EGFP and 
mMCHlR/EGFP fusion constructs into CHO cells, the resulting fusion proteins 
exhibited functional activity comparable to that of the wild-type human MCH1R short 
form. By this functional assay, the EC50 values for mMCHlR-l-EGFP and 

35 mMCHlR/EGFP were comparable to that of the wild-type human receptor (Table 1). 
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Transient transfections with the corresponding Emerald constructs yielded similar 
results (data not shown). 



Table 1 

5 



Receptor 


EC50 (nM) 


Wild-type Human MCH1R Short Form 


2.166 


Mouse MCH1R/EGFP 


0.819 


Mouse MCH1R-1-EGFP 


3.199 



Following transient transfections of the human short form/mouse 
species chimeric MCH1R-1-EGFP (HuShoit/mMCHlR-l-EGFP) and human long 
form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHlR-l-EGFP) 
10 constructs into HEK293 cells, the resulting fusion proteins exhibited functional 
activity comparable to that of the wild-type human MHC1R short and long forms, 
respectively. By this functional assay, the EC50 value for each fusion proteins was 

nearly identical to that of the corresponding wild-type human receptor (Table 2). 
15 Table 2 



Receptor 


EC50 (nM) 


Wild-type Human MCH1R Short Form 


22.27 


HuShort/mMCHlR-l-EGFP 


19.54 


Wild-type Human MCH1R Lone Form 


196.7 


HuLon e/ mMCH 1R-1-EGFP Form 


217.5 



Following transient transfections of the human short form/mouse 
species chimeric MCH1R-1-EGFP (HuShort/mMCHlR-1-EGFP) and human long 
20 form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHlR-l-EGFP) 

constructs into CHO cells, the resulting fusion proteins exhibited functional activity 
comparable to or less than that of the wild-type human MHC1R short and long forms, 
respectively (Table 3). By this functional assay, the EC50 value for the human short 

fonn/mouse species chimeric MCH1R-1-EGFP fusion protein was comparable to that 
25 of the corresponding wild-type human receptor, whereas, the human long form/mouse 
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species chimeric MCH1R-1-EGFP fusion protein had an EC50 value approximately 
7.5-fold higher than that of its corresponding wild-type control. 



Table 3 

5 



Receptor 


EC50 (nM) 


Wild-type Human MCH1R Short Form 


1.029 


Wild-type Human MCH1R Long Form 


1.515 


HuShort/mMCHlR-l-EFGP 


1.565 


HuLong/mMCHlR-l-EGFP 


11.580 



Transient expression of all the MCH1R/GFP variant fusion proteins 
that underwent functional evaluation resulted in fluorescence primarily associated 
with the plasma membrane in both HEK293 and CHO cells (data not shown). This 
10 pattern of fluorescence is consistent with a predominant membrane associated 
localization. 

Example 4: Generation of Stable Cell Lines 

Wild-type CHO cells were transfected using SuperFect (Qiagen) and 

15 either mouse MCH-1R-EGFP or human short/mouse species chimeric MCH-1R- 
EGFP. Forty-eight hours after transfection, transfected cells were subjected to 
positive selection for approximately ten days in media containing G418. Following 
selection, MCH-1R-EGFP expressing CHO cells were bulk sorted by Fluorescence 
Assisted Cell Sorting (FACS) for one or two rounds on the basis of fluorescence 

20 intensity to increase the population of cells expressing EGFP. Following bulk sorts, 
individual clones of varying fluorescence intensities were isolated by FACS and 
expanded. 

Fluorometric Microvolume Assay Technology (FMAT) was initially 
employed to screen a large number of stable clones by whole cell binding with a 
25 fluorescently labeled MCH derivative (SymJz-MCH, PE Biosystems) to identify those 
clones with good specific binding windows. Several clones exhibiting specific 
binding windows greater than 3 -fold were further evaluated for MCH binding with the 
SPA-based Binding Assay. Cells from individual clones were dissociated in enzyme 
free dissociation media and cell membranes were prepared and subsequently tested for 
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their ability to bind [ 125 I]Phe 13 Tyr 19 -MCH in the presence of human MCH. CHO cell 
lines expressing either mouse MCH-1R-EGFP or human short/mouse species 
chimeric MCH-1R-EGFP (Figure 4) displayed IC50 values with MCH that were 
indistinguishable from the corresponding IC50 values obtained with a CHO cell line 
5 expressing the wild-type human short isoform of MCH-1R. 

The functional activity of these clones was evaluated with the cAMP 
Flashplate Assay (Figures 2 and 3). CHO cell lines expressing either mouse MCH- 
1R-EGFP (Figure 2) or human short/mouse species chimeric MCH-1R-EGFP (Figure 
3) displayed EC50 values with human MCH that were indistinguishable from the 

10 EC50 value obtained with a CHO cell line expressing the wild-type human short 
isoform of MCH-1R. 

The subcellular localization of the MCH-1R-EGFP fusion proteins 
were determined by confocal microscopy utilizing EGFP fluorescence as a marker for 
MCH-1R expression. CHO cell lines stably expressing either mouse MCH-1R-EGFP 

15 or human short/mouse species chimeric MCH-1R-EGFP displayed EGFP 

fluorescence primarily associated with the plasma membrane, demonstrating that 
these MCH-1R-EGFP fusion proteins are primarily associated with the plasma 
membrane. 

Other embodiments are within the following claims. While several 
20 embodiments have been shown and described, various modifications may be made 
without departing from the spirit and scope of the present invention. 
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WHAT IS CLAIMED IS: 



1 . A fusion protein comprising: 

a) a melanin concentrating hormone receptor polypeptide 
5 region comprising a sequence selected from the group consisting of: SEQ. ID. NO. 1, 

SEQ. ID. NO. 2, SEQ. SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. NO. 5; and 

b) a fluorescent polypeptide region joined directly, or 
though a linker, to the carboxy side of said melanin concentrating hormone receptor 
polypeptide region. 

10 

2. The protein of claim 1, wherein said fluorescent polypeptide 
region consists of an amino acid sequence selected from the group consisting of SEQ. 
ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO. 10. 



15 3. The protein of claim 2, wherein said melanin concentrating 

hormone polypeptide region consists of a sequence selected from the group consisting 
of: SEQ. ID. NO. 1, SEQ. ID. NO. 2, SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. 
NO. 5. 

20 4. The protein of claim 3, wherein said protein consists essentially 

of said melanin concentrating hormone receptor polypeptide region and said 
fluorescent polypeptide region. 

5. The protein of claim 4, wherein said protein consists of the 
25 amino acid sequence of SEQ. ID. NO. 11 or SEQ. ID. NO. 12. 



6. The protein of claim 1, wherein said melanin concentrating 
hormone polypeptide region is a chimeric polypeptide comprising (a) an MCH 
binding region from a first species and (b) a transmembrane and intracellular domain 
30 region from a second species joined directly, or though a linker, to the carboxy side of 
said MCH binding region. 



7. The protein of claim 6, wherein said fluorescent polypeptide 
region consists of an amino acid sequence selected from the group consisting of: SEQ. 
35 ED. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO. 10. 
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8. The protein of claim 7, wherein said protein consists of the 
amino acid sequence of SEQ. ID. NO. 13 or SEQ. ID. NO. 14. 

5 9. A chimeric melanin concentrating hormone protein comprising: 

a) a melanin concentrating hormone binding region 
characteristic of a human melanin concentrating hormone receptor; 

b) a transmembrane domain characteristic of a non-human 
melanin concentrating hormone receptor; and 

10 c) an intracellular domain characteristic of a non-human 

melanin concentrating hormone receptor. 

10. The protein of claim 9, wherein said protein comprises a 
melanin concentrating hormone receptor polypeptide having a sequence similarity of 

15 at least 75% with either SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

11. The protein of claim 10, wherein said protein comprises the 
sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

20 12. The protein of claim 11, wherein said protein consists of the 

sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

13. A nucleic acid comprising a nucleotide sequence encoding for 
the protein of claim 1. 

25 

14. The nucleic acid of claim 13, wherein said nucleotide sequence 
is a contiguous sequence. 

15. The nucleic acid of claim 13, wherein said nucleotide sequence 
30 is selected from the group consisting of SEQ. ID. NO. 26, SEQ. ID. NO. 27, SEQ. ID. 

NO. 28 and SEQ. ID. NO. 29. 

16. A nucleic acid comprising a nucleotide sequence encoding for 
the protein of claim 9. 

35 
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17. The nucleic acid of claim 16, wherein said nucleotide sequence 
is a contiguous sequence. 

18. The nucleic acid of claim 16, wherein said nucleotide sequence 
5 is selected from the group consisting of SEQ. ID. NO. 19 and SEQ. ID. NO. 20. 

19. An expression vector comprising the nucleic acid of claim 13. 

20. An expression vector comprising the nucleic acid of claim 16. 

10 

21. A recombinant cell comprising the nucleic acid of claim 13. 

22. The recombinant cell of claim 21, wherein said nucleic acid is 
present in an expression vector. 

15 

23. The recombinant cell of claim 21, wherein said nucleic acid is 
present in the genome of said cell. 

24. A recombinant cell comprising the nucleic acid of claim 16. 

20 

25. The recombinant cell of claim 24, wherein said nucleic acid is 
present in an expression vector. 

26. The recombinant cell of claim 24, wherein said nucleic acid is 
25 present in the genome of said cell. 

27. A non-human transgenic animal comprising the nucleic acid of 

claim 13. 

30 28. A non-human transgenic animal comprising the nucleic acid of 

claim 16. 

29. A method for assaying for melanin concentrating hormone 
receptor active compounds comprising the steps of: 
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a) contacting the cell of claim 21 with a test preparation 
comprising one or more test compounds; and 

b) measuring the effect of said test preparation on one or 
more melanin concentrating hormone receptor activities. 

5 

30. A method for assaying for melanin concentrating hormone 
receptor active compounds comprising the steps of: 

a) contacting the cell of claim 24 with a test preparation 
comprising one or more test compounds; and 
10 b) measuring the effect of said test preparation on one or 

more melanin concentrating hormone receptor activities. 
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WT hShort MCH-1R 
3C5 mMCH-1 R-EGFP 
3D2 mMCH-1 R-EGFP 
6A4 mMCH-1 R-EGFP 
6C3 mMCH-1 R-EGFP 
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13 
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3 

O 
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< 
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O 




WT hShort MCH-1 R 
11C3 hShort/m MCH-1 R-EGFP 
12A3 hShort/m MCH-1 R-EGFP 
12B4 hShort/m MCH-1 R-EGFP 
12B6 hShort/m MCH-1 R-EGFP 
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SEQUENCE LISTING 

<110> Merck & Co . , Inc. 

<120> MELANIN CONCENTRATING HORMONE RECEPTOR 
CHIMERIC AND FUSION PROTEINS 



<130> 20652 PCT 

<150> 60/189,698 
<151> 2000-03-15 

<160> 37 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 422 
<212> PRT 
<213> Human 



<400> 1 



Met 


Ser 


Val 


Gly 


Ala 


Met 


Lys 


Lys 


Gly Val 


Gly Arg Ala 


Val 


Gly 


Leu 


1 


















10 










15 




Gly 


Gly 


Gly 


Ser 


Gly 


Cys 


Gin 


Ala 


Thr 


Glu 


Glu 


Asp 


Pro 


Leu 


Pro 


Asn 








20 










25 










30 






Cys 


Gly 


Ala 


Cys 


Ala 


Pro 


Gly 


Gin 


Gly Gly 


Arg 


Arg 


Trp 


Arg 


Leu 


Pro 






35 










40 










45 








Gin 


Pro 


Ala 


Trp 


Val 


Glu 


GIv 


Ser 


Ser 


Ala 


Arg 


Leu 


Trp 


Glu 


Gin 


Ala 




50 










55 










60' 










Thr 


Gly 


Thr 


Gly 


Trp 


Met 


Asp 


Leu 


Glu 


Ala 


Ser 


Leu 


Leu 


Pro 


Thr 


Gly 


65 








70 










75 










80 


Pro 


Asn 


Ala 


Ser 


Asn 


Thr 


Ser 


Asp 


Gly 


Pro 


Asp 


Asn 


Leu 


Thr 


Ser 


Ala 










85 










90 










95 




Gly 


Ser 


Pro 


Pro 


Arg 


Thr 


Gly 


Ser 


He 


Ser 


Tyr 


He 


Asn 


He 


He 


Met 






100 










105 










110 






Pro 


Ser 


Val 


Phe 


Gly 


Thr 


He 


Cys 


Leu 


Leu 


Gly 


He 


He 


Gly 


Asn 


Ser 






115 










120 










125 








Thr 


Val 


He 


Phe 


Ala 


Val 


Val 


Lys 


Lys 


Ser 


Lys 


Leu 


His 


Trp 


Cys 


Asn 




130 










135 










140 










Asn 


Val 


Pro 


Asp 


He 


Phe 


He 


He 


Asn 


Leu 


Ser 


Val 


Val 


Asp 


Leu 


Leu 


145 








150 










155 










160 


Phe 


Leu 


Leu 


Gly 


Met 


Pro 


Phe 


Met 


He 


His 


Gin 


Leu 


Met 


Gly Asn 


Gly 








165 










170 










175 




Val 


Trp 


His 


Phe 


Gly 


Glu 


Thr 


Met 


Cys 


Thr 


Leu 


He 


Thr 


Ala 


Met 


Asp 








180 










185 










190 






Ala 


Asn 


Ser 


Gin 


Phe 


Thr 


Ser 


Thr 


Tyr 


He 


Leu 


Thr 


Ala 


Met 


Ala 


He 






195 










200 










205 








Asp 


Arg 


Tyr 


Leu 


Ala 


Thr 


Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


Lys 


Phe 


Arg 




210 










215 










220 










Lys 


Pro 


Ser 


Val 


Ala 


Thr 


Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 


Ala 


Leu 


Ser 


225 










230 










235 










240 


Phe 


lie 


Ser 


He 


Thr 


Pro 


Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 


lie 


Pro 


Phe 










245 










250 










255 




Pro 


Gly 


Gly 


Ala 


Val 


Gly 


Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 


Pro 


Asp • 


Thr 








260 










265 










270 






Asp 


Leu 


Tyr 


Trp 


Phe 


Thr 


Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 


Phe 


Ala 


Leu 




275 








280 










285 








Pro 


Phe 


Val 


Val 


He 


Thr 


Ala 


Ala 


Tyr 


Val 


Arg 


He 


Leu 


Gin 


Arg 


Met 



290 295 300 
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Thr 


Ser 


Ser 


Val 


TV I 

Ala 


Pro 


Aia 


Ser 


305 










Tin 

o 1U 






Lys 


Arg 


Val 


Thr 


Arg 


Thr 


Ala 


lie 








O <c 0 








Cys 


Trp 


Ala 


Pro 
340 


Tyr 


Tyr 


Val 


Leu 


Arg 


Pro 


Thr 


Leu 


Thr 


Phe 


Val 


Tyr 




355 










360 


Gly Tyr 


Ala 


Asn 


Ser 


Cys 


Leu 


Asn 




370 










375 




Glu 


Thr 


Phe 


Arg 


Lys 


Arg 


Leu 


Val 


385 










390 






Gly Glr. 


Leu 


Arg 


Ala 


Val 


Ser 


Asn 










405 








Thr 


Glu 


Ser 


Lys 
420 


Gly 


Thr 







<210> 2 
<211> 353 
<212> PRT 
<213> Human 

<400> 2 



Met 


Asp 


Leu 


Glu 


Ala 


Ser 


Leu 


Leu 


1 








5 








Thr 


Ser 


Asp 


Gly 


Pro 


Asp 


Asn 


Leu 








20 










Thr 


Gly 


Ser 


He 


Ser 


Tyr 


He 


Asn 






35 










40 


Thr 


lie 


Cys 


Leu 


Leu 


Gly 


lie 


He 




50 










55 




Val 


Val 


Lys 


Lys 


Ser 


Lvs 


Leu 


His 


65 










70 






Phe 


He 


He 


Asn 


Leu 


Ser 


Val 


Val 










85 








Pro 


Phe 


Met 


He 


His 


Gin 


Leu 


Met 








100 










Glu 


Thr 


Met 


Cys 


Thr 


Leu 


lie 


Thr 






115 










120 


Thr 


Ser 


Thr 


Tyr 


He 


Leu 


Thr 


Ala 




13 0 










135 




Thr 


Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


145 










150 






Thr 


Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 










165 








Pro 


Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 








180 










Gly 


Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 






195 










200 


Thr 


Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 




210 










215 




Thr 


Ala 


Ala 


Tyr 


Val 


Arg 


lie 


Leu 


225 










230 






Pro 


Ala 


Ser 


Gin 


Arg 


Ser 


lie 


Arg 










245 








Thr 


Ala 


He 


Ala 


lie 


Cys 


Leu 


Val 








260 








Tyr 


Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 






275 










280 


Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 




290 










295 





uxn 


Arg 


Ser 


lie 


Arg 


Leu 


Arg 


1 ilL 






ji J 










J U 


Ala 


lie 


Cys 


Leu 


Val 


Phe 


Phe 


Val 




330 










335 




Gin 


Leu 


Thr 


Gin 


Leu 


Ser 


lie 


Ser 


345 










350 






Leu 


Tyr 


Asn 


Ala 


Ala 


lie 


Ser 


Leu 










365 








Pro 


Phe 


Val 


Tyr 


lie 


Val 


Leu 


Cys 








380 










Leu 


Ser 


Val 


Lys 


Pro 


Ala 


Ala 


Gin 






395 










400 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu 


Glu 


Arg 




410 










415 





Pro 


Thr 


Gly Pro 


Asn 


Ala 


Ser 


Asn 




10 










15 




Thr 


Ser 


Ala Gly 


Ser 


Pro 


Pro 


Arg 


25 










30 






lie 


lie 


Met 


Pro 


Ser 


Val 


Phe 


Gly 










45 








Gly 


Asn 


Ser 


Thr 


Val 


He 


Phe 


Ala 








60 










Trp 


Cys 


Asn 


Asn 


Val 


Pro 


Asp 


lie 






75 










80 


Asp 


Leu 


Leu 


Phe 


Leu 


Leu 


Glv 


Met 




90 










95 




Gly 


Asn 


Gly 


Val 


Trp 


His 


Phe 


Gly 


105 










110 






Ala 


Met 


Asp 


Ala 


Asn 


Ser 


Gin 


Phe 










125 








Met 


Ala 


lie 


Asp 


Arg 


Tyr 


Leu 


Ala 








140 










Lys 


Phe 


Arg 


Lys 


Pro 


Ser 


Val 


Ala 






155 










160 


Ala 


Leu 


Ser 


Phe 


lie 


Ser 


lie 


Thr 




170 










175 




lie 


Pro 


Phe 


Pro 


Gly 


Gly Ala 


Val 


185 










190 






Pro 


Asp 


Thr 


Asp 


Leu 


Tyr 


Trp 


Phe 










205 








Phe 


Ala 


Leu 


Pro 


Phe 


Val 


Val 


lie 








220 










Gin 


Arg 


Met 


Thr 


Ser 


Ser 


Val 


Ala 






235 










240 


Leu 


Arg 


Thr 


Lys 


Arg 


Val 


Thr 


Arg 




250 










255 




Phe 


Phe 


Val 


Cys 


Trp 


Ala 


Pro 


•Tyr 


265 










270 






Ser 


lie 


Ser 


Arg 


Pro 


Thr 


Leu 


Thr 










285 








lie 


Ser 


Leu 


Gly 


Tyr 


Ala 


Asn 


Ser 








300 
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Cys 


Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


Val 


Leu 


Cys 


Glu Thr Phe 


Arg 


Lys 


305 










310 










315 






320 


Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 


Ala 


Gin 


Gly Gin Leu 


Arg 


Ala 








325 










330 






335 




Val 


Ser 


Asn 


Ala 
340 


Gin 


Thr 


Ala 


Asp 


Glu 
345 


Glu 


Arg 


Thr Glu Ser 
350 


Lys 


Gly 



<2i0> 3 
<211> 353 
<212> PRT 
<213> Mouse 



<400> 3 



Met 


Asp 


Leu 


Gin 


Ala 


Ser 


Leu 


Leu 


Ser 


Thr 


Gly 


Pro 


Asn 


Ala 


Ser 


Asn 


1 






5 










1U 










Id 




lie 


Ser 


Asp 


Gly 
20 


Gin 


Asp 


Asn 


Pne 


Thr 

25 


Leu 


ax a 


Gly 


Pro 


Pro 
30 


Pro 


Arg 


Thr 


Arg 


Ser 


Val 


Ser 


Tyr 


lie 


Asn 


lie 


lie 


Met 


Pro 


Ser 


Val 


pne 






35 










40 










45 








Thr 


He 


Cys 


Leu 


Leu 


Gly 


lie 


Val 


Gly 


Asn 


Ser 


Thr 


val 


xie 


pne 


Ala 




50 


















b U 










Val 


Val 


Lys 


Lys 


Ser 


Lys 


Leu 


His 


Trp 


Cys 


Ser 


Asn 


Val 


Pro 


Asp 


Tin 

lie 


65 










70 










75 










Q ft 

oU 


Phe 


He 


He 


Asn 


Leu 
85 


Ser 


Val 


Val 


Asp 


Leu 
90 


Leu 


Phe 


Leu 


Leu 


Gly 
95 


Met 


Pro 


Phe 


Met 


He 
100 


His 


Gin 


Leu 


Met 


Gly 
105 


Asn 


Gly 


Val 


Trp 


His 
110 


Pne 


Gly 


Glu 


Thr 


Met 


Cys 


Thr 


Leu 


He 


Thr 


Ala 


Met 


Asp 


Ala 


Asn 


Ser 


Gin 


Phe 






115 








120 










125 








Thr 


Ser 
130 


Thr 


Tyr 


He 


Leu 


Thr 
135 


Ala 


Met 


Ala 


He 


Asp 
140 


Arg 


Tyr 


Leu 


Ala 


Thr 


Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


Lys 


Phe 


Arg 


Lys 


Pro 


Ser 


Met 


Ala 


145 










150 










155 










160 


Thr 


Leu 


Val 


He 


Cys 
165 


Leu 


Leu 


Trp 


Ala 


Leu 
170 


Ser 


Phe 


He 


Ser 


He 
175 


Thr 


Pro 


Val 


Trp 


Leu 
180 


Tyr 


Ala 


Arg 


Leu 


He 
185 


Pro 


Phe 


Pro 


Gly 


Gly 
190 


Ala 


Val 


Gly 


Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 


Pro 


Asp 


Thr 


Asp 


Leu 


Tyr 


Trp 


Phe 




195 










200 










205 








Thr 


Leu 
210 


Tyr 


Gin 


Phe 


Phe 


Leu 
215 


Ala 


Phe 


Ala 


Leu 


Pro 
220 


Phe 


Val 


Val 


He 


Thr 


Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu 


Gin 


Arg 


Met 


Thr 


Ser 


Ser 


Val 


Ala 


225 










230 










235 










240 


Pro 


Ala 


Ser 


Gin 


Arg 
245 


Ser 


He 


Arg 


Leu 


Arg 
250 


Thr 


Lys 


Arg 


Val 


Thr 
255 


Arg 


Thr 


Ala 


lie 


Ala 
260 


He 


Cys 


Leu 


Val 


Phe 
265 


Phe 


Val 


Cys 


Trp 


Ala 
270 


Pro 


Tyr 


Tyr 


Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


Ser 


He 


Ser 


Arg 


Pro 


Thr 


Leu 


Thr 




275 










280 










285 








Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


He 


Ser 


Leu 


Gly Tyr Ala 


Asn 


Ser 




290 










295 










300 










Cys 


Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


Val 


Leu 


Cys 


Glu 


Thr 


Phe 


Arg 


Lys 


305 










310 










315 










320 


Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 


Ala 


Gin 


Gly Gin 


Leu 


Arg 


Thr 










325 










330 










335 




Val 


Ser 


Asn 


Ala 
340 


Gin 


Thr 


Ala 


Asp 


Glu 
345 


Glu 


Arg 


Thr 


Glu 


Ser 
350 


Lys 


Gly 



Thr 
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<210> 4 
<211> 353 
<2I2> PRT 

<2I3> Artificial Sequence 
<220> 

<223> Human short form/mouse species chimeric MCH1R 



<400> 4 



Met 


Asp 


Leu 


Glu 


Ala 


Ser 


Leu 


Leu 


Pro 


Thr 


Gly 


Pro 


Asn 


Ala 


Ser 


Asn 


1 






5 










10 










15 




Thr 


Ser 


Asp 


Gly 


Pro 


Asp 


Asn 


Leu 


Thr 


Ser 


Ala 


Gly 


Ser 


Pro 


Pro 


Arg 








20 










2 5 










30 






Thr 


Gly 


Ser 


He 


Ser 


Tyr 


He 


Asn 


He 


He 


Met 


Pro 


Ser 


Val 


Phe 


Gly 




35 










40 










45 








Thr 


He 


Cys 


Leu 


Leu 


Gly 


He 


He 


Gly 


Asn 


Ser 


Thr 


Val 


lie 


Phe 


Ala 




50 










55 










60 










Val 


Val 


Lys 


Lys 


Ser 


Lys 


Leu 


His 


Trp 


Cys 


Asn 


Asn 


Val 


Pro 


Asp 


He 


65 










70 










75 










80 


Phe 


He 


He 


Asn 


Leu 


Ser 


Val 


Val 


Asp 


Leu 


Leu 


Phe 


Leu 


Leu 


Gly 


Met 










85 










90 










95 




Pro 


Phe 


Met 


He 


His 


Gin 


Leu 


Met 


Gly 


Asn 


Gly Val 


Trp 


His 


Phe 


Gly 








100 










105 










110 






Glu 


Thr 


Met 


Cys 


Thr 


Leu 


He 


Thr 


Ala 


Met 


Asp 


Ala 


Asn 


Ser 


Gin 


Phe 






115 










120 










125 








Thr 


Ser 


Thr 


Tyr 


He 


Leu 


Thr 


Ala 


Met 


Ala 


He 


Asp 


Arg 


Tyr 


Leu 


Ala 




130 










135 










140 










Thr 


Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


Lys 


Phe 


Arg 


Lys 


Pro 


Ser 


Met 


Ala 


145 










150 










155 










160 


Thr 


Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 


Ala 


Leu 


Ser 


Phe 


lie 


Ser 


lie 


Thr 










165 










1 / u 










1 /b 




Pro 


Val 


Trp 


Leu 


Tyr 


Ala 


Arg Leu 


He 


Pro 


Phe 


Pro Gly Gly 


Ala 


Val 








180 










185 










190 






Gly 


Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 


Pro 


Asp 


Thr Asp Leu Tyr 


Trp 


Phe 






195 










200 










205 








Thr 


Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 


Phe 


Ala 


Leu 


Pro 


Phe 


Val 


Val 


lie 




210 










215 










220 










Thr 


Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu 


Gin 


Arg 


Met 


Thr 


Ser 


Ser 


Val 


Ala 


225 










230 










235 










240 


Pro 


Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg 


Leu 


Arg 


Thr 


Lys 


Arg 


Val 


Thr 


Arg 










245 










250 










255 




Thr 


Ala 


He 


Ala 


He 


Cys 


Leu 


Val 


Phe 


Phe 


Val 


Cys 


Trp 


Ala 


Pro 


Tyr 








260 










265 










270 






Tyr 


Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


Ser 


He 


Ser 


Arg 


Pro 


Thr 


Leu 


Thr 




275 










280 










285 








Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


He 


Ser 


Leu Gly Tyr Ala 


Asn 


Ser 




290 










295 










300 










Cys 


Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


lie 


Val 


Leu 


Cys 


Glu 


Thr 


Phe 


Arg 


Lys 


305 










310 










315 










320 


Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 


Ala 


Gin 


Gly 


Gin 


Leu 


Arg 


Thr 








325 










330 










335 




Val 


Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu 


Glu 


Arg 


Thr 


Glu 


Ser 


Lys 


Gly 



340 345 350 



Thr 
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<223> Human long form/mouse species chimeric MCK1R 



<400> 5 



Met 


Ser 


Val 


Gly 


Ala 


Met 


Lys 


Lys 


Gly 


Val 


Gly 


Arg 


Ala 


Val 


Gly 


Leu 


1 








5 










10 










15 




Gly Gly 


Gly 


Ser 


Gly 


Cys 


Gin 


Ala 


Thr 


Glu 


Glu 


Asp 


Pro 


Leu 


Pro 


Asn 








20 










25 










30 






Cys 


Gly 


Ala 
35 


Cys 


Ala 


Pro 


Gly 


Gin 
40 


Gly 


Gly 


Arg 


Arg 


Trp 
45 


Arg 


Leu 


Pro 


Gin 


Pro 
50 


Ala 


Trp 


Val 


Glu 


Gly 
55 


Ser 


Ser 


Ala 


Arg 


Leu 
60 


Trp 


Glu 


Gin 


Ala 


Thr 


Gly 


Thr 


Gly 


Trp 


Met 


Asp 


Leu 


Glu 


Ala 


Ser 


Leu 


Leu 


Pro 


Thr 


Gly 


65 










70 










75 










80 


Pro 


Asn 


Ala 


Ser 


Asn 
85 


Thr 


Ser 


Asp 


Gly 


Pro 
90 


Asp 


Asn 


Leu 


Thr 


Ser 
95 


Ala 


Gly 


Ser 


Pro 


Pro 
100 


Arg 


Thr 


Gly 


Ser 


He 
105 


Ser 


Tyr 


lie 


Asn 


He 

110 


He 


Met 


Pro 


Ser 


Val 


Phe 


Gly 


Thr 


He 


Cys 


Leu 


Leu 


Gly 


He 


He 


Gly Asn 


Ser 






115 










120 










125 








Thr 


Val 
130 


He 


Phe 


Ala 


Val 


Val 
135 


Lys 


Lys 


Ser 


Lys 


Leu 
140 


His 


Trp 


Cys 


Asn 


Asn 


Val 


Pro 


Asp 


He 


Phe 


lie 


He 


Asn 


Leu 


Ser 


Val 


Val 


Asp 


Leu 


Leu 


145 










150 










155 










160 


Phe 


Leu 


Leu 


Gly 


Met 


Pro 


Phe 


Met 


He 


His 


Gin 


Leu 


Met 


Gly Asn 


Gly 










165 










170 










175 




Val 


Trp 


His 


Phe 
180 


Gly 


Glu 


Thr 


Met 


Cys 
185 


Thr 


Leu 


He 


Thr 


Ala 
190 


Met 


Asp 


Ala 


Asn 


Ser 
195 


Gin 


Phe 


Thr 


Ser 


Thr 
200 


Tyr 


He 


Leu 


Thr 


Ala 
205 


Met 


Ala 


He 


Asp 


Arg 
210 


Tyr 


Leu 


Ala 


Thr 


Val 
215 


His 


Pro 


He 


Ser 


Ser 
220 


Thr 


Lys 


Phe 


Arg 


Lys 


Pro 


Ser 


Met 


Ala 


Thr 


Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 


Ala 


Leu 


Ser 


225 










230 










235 










240 


Phe 


He 


Ser 


lie 


Thr 
245 


Pro 


Val 


Trp 


Leu 


Tyr 
250 


Ala 


Arg 


Leu 


He 


Pro 
255 


Phe 


Pro 


Gly 


Gly 


Ala 
260 


Val 


Gly 


Cys 


Gly 


lie 
265 


Arg 


Leu 


Pro 


Asn 


Pro 
270 


Asp 


Thr 


Asp 


Leu 


Tyr 
275 


Trp 


Phe 


Thr 


Leu 


Tyr 
280 


Gin 


Phe 


Phe 


Leu 


Ala 
285 


Phe 


Ala 


Leu 


Pro 


Phe 
290 


Val 


Val 


He 


Thr 


Ala 
295 


Ala 


Tyr 


Val 


Lys 


He 
300 


Leu 


Gin 


Arg 


Met 


Thr 


Ser 


Ser 


Val 


Ala 


Pro 


Ala 


Ser 


Gin 


Arg 


Ser 


lie 


Arg 


Leu 


Arg 


Thr 


305 










310 










315 










320 


Lys 


Arg 


Val 


Thr 


Arg 
325 


Thr 


Ala 


He 


Ala 


He 
330 


Cys 


Leu 


Val 


Phe 


Phe 
335 


Val 


Cys 


Trp 


Ala 


Pro 


Tyr 


Tyr 


Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


Ser 


He 


Ser 






340 










345 










350 






Arg 


Pro 


Thr 


Leu 


Thr 


Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


He 


Ser 


Leu 




355 










360 










365 








Gly 


Tyr 


Ala 


Asn 


Ser 


Cys 


Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


Val 


Leu 


Cys 




370 










375 










380 








Glu 


Thr 


Phe 


Arg 


Lys 


Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 


Ala 


Gin 


385 










390 










395 










400 


Gly Gin 


Leu 


Arg 


Thr 


Val 


Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu 


Glu 


Arg 










405 










410 










415 




Thr 


Glu 


Ser 


Lys 
420 


Gly 


Thr 























<210> 6 

<2il> 238 

<212> PRT 

<213> Aequorea Victoria 
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<400> 6 














Met Ser 


Lys 


Gly Glu Glu 


Leu 


Phe 


1 




5 








Glu Leu 


Asp 


Gly Asp Val 


Asn 


Gly 






20 










Gly Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 




35 










40 


Thr Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


50 










c c 




Ser Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


65 








70 






Kis Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 








85 








Thr lie 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 






100 










Lys Phe 


Glu 


Gly Asp 


Thr 


Leu 


Vai 




115 










1 Oft 


Asd Phe 


Lys 


Glu Asp Gly 


Asn 


He 


130 










135 




Tyr Asn 


Ser 


His 


Asn 


Val 


Tyr 


lie 


145 








150 






lie Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 






165 








Gin Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 






180 










Val Leu 


Leu 


Pro 


Asp Asn 


His 


Tyr 




195 










200 


Lys Asd 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


210 










215 




Thr Ala 


Ala 


Gly 


He 


Thr 


His 


Gly 


225 








230 






<210> 7 














<211> 239 












<212> PRT 












<213> Artificial Sequence 




<220> 














<223> GF? derivative 






<400> 7 














Met Val 


Ser 


Lys 


Gly 
5 


Glu 


Glu 


Leu 


i 

Val Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 






20 










Glu Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 




35 










40 


Cys Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


50 










55 




Leu Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


65 








70 






Gin His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 








85 








Arg Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 






100 










Val Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 




115 










120 


lie Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


130 










135 




Asn Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


145 








150 







111- 


Gly Val 


val 


Pro 


Tip 

lie 


Leu 


Val 




10 










i o 




niS 


Lys 


Phe 


Ser 


vai 


Ser 


uiy 




O R 
/SO 










O u 






Lys 


Leu 


Thr 


Leu 


Lys 


irne 


lie 


cys 


















Trp 


Pro 


Thr 


Leu 


Val 


1 111 


i in 


- Iltr 








O u 










Arg 


Tyr 


Pro 


Asp 




Mot- 
tit. U 


Lys 


o_n 






75 










ft n 


Pro 


Glu 


Gly 


Tyr 


Val 


Pi rt 
Ljlii 


fin 
ol U. 






90 














Asn 


Tyr 


Lys 


i n_ 


Arc 


-Hi a 


ul — 


\7a i 

val 


105 










1 1 \j 






Asn 


Arg 


He 


uJ.ll 




Lys 




lie 










125 








Leu 


Gly His 


Lys 


Leu 


Glu 


Tyr 


Asn 








140 










Met Ala Asp 


Lys 


Gin 


Lys 


Asn 


Gly 






155 










160 


His 


Asn 


He 


IjlU 


Asp 


oiy 


Car 
»D"l 


Val 
v al 




170 










175 




Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 


Pro 


185 










190 






Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 


Ser 










205 








His 


Met 


Val 


Leu 


Leu 


VjlU 


-file 


\/ a l 
val 








00 (\ 










Met 


Asp 


Glu 


Leu 


Tyr 


Lys 










235 












Phe 


Thr 


Gly 


Val 


val 


Pro 


lie 


Leu 




10 










lb 




Gly Kis Lys 


Phe 


Ser 


val 


Ser 


biy 


25 










J U 






Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


lie 










45 








Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 








60 










Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 






75 










80 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 




90 










95 




Gly Asn' Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


105 










110 






Val 


Asn Arg 


He 


Glu 


Leu 


Lys 


Gly 










125 








He Leu Gly 


His 


Lys 


Leu 


Glu 


Tyr 








140 










He 


Met 


Ala 

1 c;t; 


Asp 


Lys 


Gin 


Lys 


Asn 



155 160 
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Gly He Lys Val 


Asn 


Phe Lys He Arg His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 










165 


170 










175 




Val 


Gin 


Leu 


Ala 


Asp Kis Tyr Gin Gin Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 








180 




185 








190 






Pro 


Val 


Leu 


Leu 


Pro 


Asp Asn His Tyr Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 






195 






200 






205 








Ser 


Lys 


Asp 


Pro 


Asn 


Glu Lys Arg Asp His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 




210 








215 




220 










Val 


Thr 


Ala 


Ala 


Gly 


He Thr Leu Gly Met 


Asp 


Glu 


Leu 


Tyr 


Lys 





225 230 235 



<210> 8 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> G?P derivative 



<400> 8 














Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


1 








5 








Val 


Glu 


Leu Asp 


Gly 


Asp 


Val 


Asn 








20 










Glu Gly 


Glu Gly Asp 


Ala 


Thr 


Tyr 






35 










40 


Cys 


Thr 


Thr 


Gly Lys 


Leu 


Pro 


Val 




50 










55 




Leu 


Thr 


Tyr 


Gly Val 


Gin 


Cys 


Phe 


65 










70 






Gin 


His 


Asp 


Phe 


Phe 
85 


Lys 


Ser 


Ala 


Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp Asp 








100 










Val 


Lys 


Phe 


Glu Gly 


Asp 


Thr 


Leu 






115 










120 


lie 


ASD 


Phe 


Lys 


Glu 


Asp 


Gly Asn 




130 










135 




Asn 


Tyr 


Asn 


Ser 


His 


Lys 


Val 


Tyr 


145 










150 






Gly lie 


Lys 


Val 


Asn 


Phe 


Lys 


Thr 










165 








Val 


Gin 


Leu 


Ala 
180 


Asp 


His 


Tyr 


Gin 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 






195 








200 


Ser 


Lys 
210 


Asp 


Pro 


Asn 


Glu 


Lys 
215 


Arg 


Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


225 










230 







Phe 


Thr 


Gly 


Val 


Val 


Pro 


He 


Leu 




10 










15 




Gly His 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 


25 










30 






Gly Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 










45 








Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 








60 










Ala 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 






75 










80 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 




90 










95 




Gly Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


105 










110 






Val 


Asn 


Arg 


lie 


Glu 


Leu 


Lys 


Gly 










125 








lie 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tyr 








140 










lie 


Thr 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 






155 










160 


Arg 


His 


Asn 


lie 


Glu 


Asp 


Gly 


Ser 




170 










175 




Gin 


Asn 


Thr 


Pro 


lie 


Gly 


Asp 


Gly 


185 










190 






Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 










205 








Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 








220 










Gly Met 


Asp 


Glu 


Leu 


Tyr 


Lys 





235 



<210> 9 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 



<400> 9 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 
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Vai. 


pin 

ulu 


Leu 


Asp 


Gly Asp 


v ai 


Asn Gly 








20 








25 


fZl n 
<j 1 u 




Glu Gly 


Asp 


Ala 


Thr 


Tvr Glv 




35 










40 


Cys 


Tnr 


Thr Gly 


Lys 


Leu 


i- \j 


Val Pro 


D\J 










;c 




rne 


pi,, 
Vaiy 


Tyr 


Gly 


Val 


Gin 


W = 


Phe Ala 


b b 










70 






ply, 

bin 


nlS 


Asp 


Phe 


Phe 
85 


Lys 


Cor 


Al a Met 


Arg 


i nr 


He 


Phe 


Phe 


Lys 


Asp 








100 








105 


v ci J- 


Ly s 


Phe 


Glu 


Gly Asp 


Thr 


Leu Val 






115 












lie 


Asp 


Phe 


Lys 


Glu Asp 


uiy 


Asn Xle 




130 










135 




Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Vai 


Tyr lie 


145 










150 






Gly 


He 


Lys 


Val 


Asn 


Phe 


~» y o 


He Arg 










165 








Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin Gin 








180 








185 


Pro 


Val 


Leu 


Leu 


Pro Asp 


Asn 


His Tyr 






195 










200 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg Asp 




210 










215 




Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


lieu (j±y 


225 










230 






<210> 10 












<211> 239 












<212> PRT 












<213> Artificial Sequence 




<220> 














<223> GF? derivative 






<400> 10 












Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


iicu. - xitr 


1 








5 








Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Vai 


A o r* n ~\ \r 








20 








Z D 


Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


iyi vj-iy 






35 










/ n 
4t u 


Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val IriO 




50 










55 




Leu 


Thr 


Trp 


Gly 


Val 


Gin 


Cys 




65 










70 






Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala Met 










85 








Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp Gly 








100 








105 


Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu Val 






115 










120 


He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn He 




130 










135 




Asn 


Tyr 


lie 


Ser 


His 


Asn 


Val 


Tyr He 


145 










150 






Gly 


He 


Lys 


Ala 


His 


Phe 


Lys 


lie Arg 










165 








Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin Gin 








180 






185 



PCT/USO1/08O71 



His 


Lys 


Phe 


Ser 


Vai 


Ser 


Gly 


Lys 


Leu 


Thr 


Leu 

** j 


Lys 


Phe 


lie 


Trp 


Pro 


Thr 
fin 


Leu 


Val 


Thr 


Thr 


Arg 


Tyr 


Pro 


Asp 


rid. s 


Met- 
ric L. 


Arg 

O \J 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 


on 














Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 


Leu 


Gly 


His 
140 


Lys 


Leu 


Glu 


Tyr 


Met 


Ala 

155 


J\sp 


Lys 


Gin 


Lys 


A sr. 
160 


His 


Asn 


lie 


Glu 


Asp 


Gly 


Ser 


170 










175 




Asn 


Thr 


Pro 


He 


Glv 
190 


Asp 


Gly 


Leu 


Ser 


Tyr 


Gin 
205 


Ser 


Ala 


Leu 


His 


Met 


Val 
220 


Leu 


Leu 


Glu 


Phe 


Met 


Asp 
235 


Glu 


Leu 


Tyr 


Lys 





Thr 


Gly 


Val 


val 


Pro 


He 


Leu 


10 










15 




His 


Arg 


Phe 


Ser 


Val 


Ser 


Gly 










30 






Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


lie 






45 








Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 






60 










Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 




75 










80 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 


90 










95 




Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 










110 






Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 








125 








Leu Gly 


His 


Lys 


Leu 


Glu 


Tyr 






140 










Thr 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 




155 










160 


His 


Asn 


lie 


Glu 


Asp 


Gly 


Ser 


170 










175 




Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 



190 
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Pro Val 


Leu 


Leu 


Pro Asp Asn 


His 




195 










200 


Ser Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


210 










215 




Val Thr 


Ala 


Ala 


Gly 


lie 


Thr 


Leu 


225 








230 






<210> 11 












<211> 604 












<212> PRT 












<213> Artificial Sequence 




<220> 














<223> Mouse 


MCHlR-linker-EGFP 


<400> 11 












Met Asp 


Leu 


Gin 


Ala 


Ser 


Leu 


Leu 


1 






5 








He Ser 


Asp 


G 1 v 


Gin 


ASD 


Asn 


Phe 






20 










Thr Arg 


Ser 


Val 


Ser 


Tyr 


He 


Asn 




35 










40 


Thr lie 


Cys 


Leu 


Leu 


Gly 


He 


Val 


50 










55 




Val Val 


Lys 


Lys 


Ser 


Lys 


Leu 


His 


65 








70 






Phe He 


He 


Asn 


Leu 


Ser 


Val 


Val 








85 








Pro Phe 


Met 


He 


His 


Gin 


Leu 


Met 






100 










Glu Thr 


Met 


Cvs 


Thr 


Leu 


He 


Thr 




115 










120 


Thr Ser 


Thr 


nn v >- 
j — 


He 


Leu 


Thr 


Ala 


130 








135 




Thr Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


145 








150 






Thr Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 








165 








Pro Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 






180 










Gly Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 




195 










200 


Thr Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 


210 










215 




Thr Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu 


225 








230 






Pro Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg 








245 








Thr Ala 


He 


Ala 


He 


Cys 


Leu 


Val 






260 










Tyr Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


275 










280 


Phe Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


290 










295 




Cys Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


305 








310 






Arg Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 








325 








Val Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 






340 










Thr Val 


Asp 


Gly 


Thr 


Ala 


Gly 


Pro 




355 










360 



Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 










205 








Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 








220 










Gly Met Asp 


Glu 


Leu 


Tyr 


Lys 




















Ser 


Thr 


Gly 


Pro 


Asn 


/iia 


Ser 


Asn 




10 










Id 




Thr 


Leu 


Ala 


Gly 


Pro 


Pro 


Pro 


Arg 


25 










3 0 






He 


He 


Met 


Pro 


Ser 


Val 


pne 


bly 


















Gly Asn 


Ser 


Thr 


Val 


lie 


Phe 


Ala 








60 










Trp 


Cys 


Ser 


Asn 


Val 


Pro 


Asp 


He 






75 










80 


Asp 


Leu 


Leu 


Phe 


Leu 


Leu 


Gly 


Met 




90 










9d 




Gly Asn Gly 


Val 


Trp 


His 


Pne 


Gly 


105 










110 






Ala 


Met 


Asp 


Ala 


Asn 


Ser 


Gin 


Phe 










125 








Met 


Ala 


He 


Asp 


Arg 


Tyr 


Leu 


Ala 








140 










Lys 


Phe 


Arg 


Lys 


Pro 


Ser 


Met 


Ala 






155 










160 


Ala 


Leu 


Ser 


Phe 


lie 


Ser 


-Lie 


Thr 




170 










1 10 




He 


Pro 


Phe 


Pro 


Gly 


Gly 


Ala 


Val 


185 










iy u 






Pro 


Asp 


Thr 


Asp 


Leu 


Tyr 


Trp 


Pne 










one 
20b 








Phe 


Ala 


Leu 


Pro 


Phe 


Vai 


Val 


He 








220 










Gin Arg Met 


Thr 


Ser 


Ser 


Val 


Ala 






235 










240 


Leu Arg 


Thr 


Lys 


Arg 


Val 


Thr 


Arg 




250 










255 




Phe 


Phe 


Val 


Cys 


Trp 


Ala 


Pro 


Tyr 


265 










270 






Ser 


He 


Ser 


Arg 


Pre 


Thr 


Leu 


Thr 










285 








He 


Ser 


Leu 


Gly 


Tyr 


Al a 


Asn 


Ser 








300 










Val 


Leu 


Cys 


Glu 


Thr 


Phe 


Arg 


Lys 






315 










•320 


Ala 


Ala 


Gin 


Gly 


Gin 


Leu 


Arg 


Thr 




330 










335 




Glu 


Glu 


Arg 


Thr 


Glu 


Ser 


Lys 


Gly 


345 










350 






Gly 


Ser 


He 


Ala 


Thr 


Met 


Val 


Ser 



365 
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Lys 


Gly 
370 


Glu 


Glu 


Leu 


Phe 


Thr 
375 


Gly 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


385 










390 






Gly Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 










405 








Gly Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 








420 










Gly Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 






435 










440 


Phe 


Phe 
450 


Lys 


Ser 


Ala 


Met 


Pro 
455 


Glu 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 


Asn 


Tyr 


465 










470 






Glu 


Gly 


Asp 


Thr 


Leu 
485 


Val 


Asn 


Arg 


Lys 


Glu 


Asp 


Gly 
500 


Asn 


He 


Leu 


Gly 


Ser 


His 


Asn 
515 


Val 


Tyr 


He 


Met 


Ala 
520 


Val 


Asn 
530 


Phe 


Lys 


He 


Arg 


His 
535 


Asn 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


545 








550 






Leu 


Pro 


Asp 


Asn 


His 
565 


Tyr 


Leu 


Ser 


Pro 


Asn 


Glu 


Lys 
580 


Arg 


Asp 


His 


Met 


Ala 


Gly 


He 
595 


Thr 


Leu 


Gly 


Met 


Asp 
600 



<210> 12 
<211> 592 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mouse MCH1R/EGFP 



<400> 12 



Met 
1 


Asp 


Leu 


Gin 


Ala 
5 


Ser 


Leu 


Leu 


lie 


Ser 


Asp 


Gly 


Gin 


Asp 


Asn 


Phe 








20 










Thr 


Arg 


Ser 


Val 


Ser 


Tyr 


lie 


Asn 






35 










40 


Thr 


He 


Cys 


Leu 


Leu 


Gly 


lie 


Val 




50 










55 




Val 


Val 


Lys 


Lys 


Ser 


Lys 


Leu 


His 


65 










70 






Phe 


lie 


lie 


Asn 


Leu 


Ser 


Val 


Val 










85 








Pro 


Phe 


Met 


lie 


His 


Gin 


Leu 


Met 








100 










Glu 


Thr 


Met 


Cys 


Thr 


Leu 


lie 


Thr 






115 










120 


Thr 


Ser 


Thr 


Tyr 


lie 


Leu 


Thr 


Ala 




130 










135 




Thr 


Val 


His 


Pro 


lie 


Ser 


Ser 


Thr 


145 










150 






Thr 


Leu 


Val 


lie 


Cys 


Leu 


Leu 


Trp 



165 



Val 


Val 


Pro 


lie 


Leu 


Val 


Glu 


Leu 








380 










Phe 


Ser 


Val 


Ser 


Gly 


Glu 


Gly 


Glu 






395 










400 


Thr 


Leu 


Lys 


Phe 


lie 


Cys 


Thr 


Thr 




410 










415 




Thr 


Leu 


Val 


Thr 


Thr 


Leu 


Thr 


Tyr 


425 










430 






Pro 


Asp 


His 


Met 


Lvs 


Gin 


His 


Asp 










445 








Gly Tyr 


Val 


Gin 


Glu 


Arg 


Thr 


He 








460 










Lys 


Thr 


Arg 


Ala 


Glu 


Val 


Lys 


Phe 






475 










480 


lie 


Glu 


Leu 


Lys 


Gly 


He 


Asp 


Phe 




490 










495 




His 


Lys 


Leu 


Glu 


Tyr 


Asn 


Tyr 


Asn 


505 










510 






Asp 


Lys 


Gin 


Lys 


Asn 


Gly 


lie 


Lys 










525 








lie 


Glu 


Asp 


Gly 


Ser 


Val 


Gin 


Leu 








540 










Pro 


He 


Gly 


Asp 


Gly 


Pro 


Val 


Leu 






555 










560 


Thr 


Gin 


Ser 


Ala 


Leu 


Ser 


Lys 


Asp 




570 










575 




Val 


Leu 


Leu 


Glu 


Phe 


Val 


Thr 


Ala 


585 










590 






Glu 


Leu 


Tyr 


Lys 










Ser 


Thr 


Gly 


Pro 


Asn 


Ala 


Ser 


Asn 




10 










15 




Thr 


Leu 


Ala 


Gly 


Pro 


Pro 


Pro 


Arg 


25 










30 






lie 


lie 


Met 


Pro 


Ser 


Val 


Phe 


Gly 










45 








Gly Asn 


Ser 


Thr 


Val 


He 


Phe 


Ala 








60 










Trp 


Cys 


Ser 


Asn 


Val 


Pro 


Asp 


lie 






75 










80 


Asp 


Leu 


Leu 


Phe 


Leu 


Leu 


Gly 


Met 




90 










95 




Gly Asn 


Gly 


Val 


Trp 


His 


Phe 


Gly 


105 










110 






Ala 


Met 


Asp 


Ala 


Asn 


Ser 


Gin 


.Phe 










125 








Met 


Ala 


He 


Asp 


Arg 


Tyr 


Leu 


Ala 








140 










Lys 


Phe 


Arg 


Lys 


Pro 


Ser 


Met 


Ala 






155 










160 


Ala 


Leu 


Ser 


Phe 


He 


Ser 


He 


Thr 




170 










175 
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Pro 


Val 


Trp 


Leu 


Tyr 


Aia 


Arg Leu 






1 on 
1 0 U 










Gly Cys 


Gly 


lie 


Arg 


Leu 


Pro 


Asn 






19b 










200 


Thr 


Leu 


Tyr 


bin 


irne 


pne 


Leu 


Ala 




210 








215 




Thr 


Ala 


Ala 


Tyr 


val 


Lys 


He 


Leu 


225 










OTA 






Pro 


Ala 


Ser 


Gin 


Arg 


ber 


He 


Arg 










O / t; 








Thr 


Ala 


lie 


Ala 


lie 


Cys 


Leu 


Val 








O £ C\ 










Tyr 


Val 


Leu 


bin 


j_ieu 


inr 


Gin 


Leu 




275 










280 


Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 




290 










295 




Cys 


Leu 


Asn 


Pro 


pne 


vai 


Tyr 


He 


305 










"3 T A 






Arg 


Leu 


Val 


Leu 


ber 


vai 


Lys 


Pro 


















Val 


Ser 


Asn 


Ala 


bin 


Thr 


Ala 


Asp 








s a r\ 
5 40 










Thr 


Met 


Val 


Ser 


Lys 


biy 


Glu 


Glu 






T C C 










360 


Leu 


Val 


pi,, 
GlU 


Leu 


Asp 


biy 


Asp Val 




370 










375 




Gly Glu 


Gly 


bXU 


biy 


Asp 


Ala 


Thr 


385 










*3 O A 






lie 


Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 










/l A E 








Thr 


Leu 


Tnr 


Tyr 


biy 


vai 


Gin Cys 








A O A 










Lys 


Gin 


His 


Asp 


-n't, ^ 

pne 


Pne 


Lys 


Ser 




435 










440 


Glu 


Arg 


Tnr 


lie 


Pne 


pne 


Lys 


Asp 




450 










455 




Glu 


Val 


Lys 


pne 


blU 


Gly 


Asp 


Thr 


465 










/ "1 A 

4/0 






Gly 


He 


Asp 


Pne 


Lys 


\J J. LL 


Asp Gly 










485 








Tyr 


Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 








500 










Asn 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 






515 










520 


Ser 


Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 




530 










535 




Gly 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


545 










550 






Leu 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 










565 








Phe 


Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 



580 

<210> 13 
<211> 604 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> MCHlR-linker-EGFP 
<400> 13 



Tie t)rn 

xxe xrro 


rile 


Pro Gly Gly 


riXci 


val 










190 






rxO ASp 


inr 


Asp 


Leu 


Tyr 


Trp 


trfie 








205 








riie ax a 


Leu 


Pro 


Phe 


Val 


vax 


lie 






220 










Gin Arg 


WcL 


Thr 


Ser 


Ser 


vax 


Ala 




J 










£t *± v 


Lieu Arg 


TVl"v 
1 LI*. 


Lys 


Arg Val 


± LIZ. 


Arg 


OCA 














Phe Phe 


vax 


Cys 


Trp Ala 


Pro 


Tyr 


0 £ 
~ 0 0 








270 






Ser He 


Ser 


Arg 


Pro 


Thr 


Leu 


lill 








285 








lie Ser 


Leu 


Gly Tyr Ala 


Asn 


Cor 






300 










Val Leu 


Cys 


Glu 


Thr 


Phe 


Arg 


Lys 




315 












Ala Ala 


Gin 


Gly Gin Leu 


Arg 


1 ILL 


330 










"5 "7 c: 

j j j 




Glu Glu 


Arg 


Thr 


Glu 


Ser 


Lys 


bxy 


345 








350 






Leu Phe 


Thr 


Gly Val 


Val 


Pro 


lie 








365 








Asn Gly His 


Lys 


Phe 


Ser 


vax 


Ser 






380 










Tyr Gly Lys 


Leu 


Thr 


Leu 


Lys 






395 












Val Pro 


Trp 


Pro 


Thr 


Leu 


vai 


Thr 
1 ill. 


410 










/lie 




Phe Ser Arg 


Tyr 


Pro 


Asp 


nlS 


jyxe u 


425 








430 






Ala Met 


Pro 


Glu Gly Tyr 


vax 


bxn 








445 








Asp Gly Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 

ax a 






460 










Leu Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 




475 










/ion 


Asn He 


Leu 


Gly 


His 


Lys 


Leu 


pin 
bXU 


490 










4 i7 !? 




Tyr He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


505 








510 






He Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 








525 








Gin Gin 


Asn 


Thr 


Pro 


He 


Gly 


Asp 






540 










His Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 




555 










560 


Arg Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


570 










575 




Leu Gly Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


585 








590 
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Met Asp 


Leu 


Glu 


Ala 


Ser 


Leu 


Leu 


Pro Thr Gly Pro 


Asn 


Aia 


Ser 


Asn 


1 






5 










10 








15 




Thr Ser 


Asp 


Gly 


Pro 


Asp 


Asn 


Leu 


Thr 


Ser Ala Gly 


Ser 


Pro 


Pro Arg 




20 










25 








30 






Thr Gly 


Ser 


He 


Ser 


Tyr 


Tl a 

lie 


Asn 


He 


lie Met 


Pro 


Ser 


Val 


Phe 


Gly 


35 










a a 








40 








Thr lie 


Cys 


Leu 


Leu 


Gly 


lie 


lie 


Gly Asn Ser 


Thr 


vai 


lie 


Phe 


Ala 


50 








c c 
oo 








60 










Val Val 




Lys 


Ser 


Lys 


Leu 


His 


Trp Cys Asn 


Asn 


Val 


Pro 


Asp 


He 


65 






h a 








75 










80 


Phe lie 


lie 


Asn 


Leu 


Ser 


vai 


vai 


Asp 


Leu Leu 


Phe 


Leu 


Leu 


Gly Met 








O Q 

OD 










90 








95 




Pro Pne 




lie 


HI S 


um 


Leu 


rie t. 


Gly Asn Gly Val 


Trp 


His 


Phe 


Gly 






~t a a 
1 UU 










105 








110 






Glu Thr 


Met 


Cys 


Thr 


Leu 


_ie 


Thr 


Ala Met Asp Ala 


Asn 


Ser 


Gin 


Phe 




115 










lz u 








i oc 

1ZD 








Thr Ser 


Thr 


Tyr 


Tl- 
IlS 


Leu 


Thr 


Til - 

Ala 


Met 


Ala lie Asp 


Arg 


Tyr 


Leu 


Ala 


130 








135 








140 










Thr Val 


His 


Pro 


Tl- 


Ser 


Ser 


Thr 


Lys 


Phe Arg 


Lys 


Pro 


Ser 


Met 


Ala 


145 








150 








155 










160 


Thr Leu 


Val 


He 


Cys 
165 


Leu 


Leu 


Trp 


Ala 


Leu Ser 
170 


Phe 


Tl - 

lie 


Ser 


lie 
175 


Thr 


Pro Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 


He 


Pro Phe 


Pro 


Gly 


Gly Ala Val 






180 










185 








190 






Gly Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 


Pro Asp Thr Asp 


Leu 


Tyr 


Trp 


Phe 




195 










200 








205 








Thr Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 


Phe 


Ala Leu 


Pro 


Pne 


Val 


Val 


lie 


210 








215 








220 










Thr Ala 


Ala 


Tyr 


Val 


Lys 


T 1 « 

lie 


Leu 


Gin 


Arg Met 


Thr 


Ser 


Ser 


Val 


Ala 


225 






230 








235 










240 


Pro Ala 


Ser 


Gin 


Arg 


Ser 


t1 - 

He 


Arg 


Leu 


Arg Thr Lys 


Arg 


Val 


Thr 


Arg 








245 










250 








255 




Thr Ala 


He 


Ala 


He 


Cys 


Leu 


Val 


Phe 


Phe Val 


Cys 


Trp 


Ala 


Pro Tyr 






260 










265 








270 






Tyr Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


Ser 


lie Ser Arg 


Pro 


Thr 


Leu 


Thr 


275 










280 








285 








Phe Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


He 


Ser Leu Gly 


Tyr 


Ala 


Asn 


Ser 


290 










295 








300 










Cys Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


Val 


Leu Cys 


Glu 


Thr 


Phe 


Arg 


Lys 


305 








310 








315 










320 


Arg Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 


Ala Gin Gly 


Gin 


Leu 


Arg 


Thr 








325 










330 








335 




Val Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu 


Glu Arg 


Thr 


Glu 


Ser 


Lys 


Gly 






340 










345 








350 




Thr Val 


ASD 
355 


Gly 


Thr 


Aia 


Gly 


Pro 
3 6 0 


Gly 


Ser He 


Ala 


Thr 
365 


Met 


Val 


Ser 


Lys Gly Glu 


G1U 


Leu 


pne 


Thr 


Gly 


Val 


Vai Pro 


lie 


Leu 


Val 


Glu 


Leu 


370 










375 






380 










Asp Gly Asp 


Val 


Asn 


Gly 


His 


Lys 


Phe 


Ser Val 


Ser 


Gly Glu Gly Glu 


385 








n n n 








395 










400 


Gly Asp Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu Lys 


Phe 


lie 


Cys 


Thr 


Thr 








405 










410 








415 




Gly Lys 


Leu 


Pro 
420 


Val 


Pro 


Trp 


Pro 


Thr 
425 


Leu Val 


Thr 


Thr 


Leu 

430 


Thr 


Tyr 


Gly Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp His 


Met 


Lys 


Gin 


His 


Asp 




435 










440 








445 






Phe Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly Tyr Val 


Gin 


Glu 


Arg 


Thr 


lie 


450 










455 








460 










Phe Phe 


Lys 


Asp 


Asp 


Gly 


Asn Tyr 


Lys 


Thr Arg 


Ala 


Glu 


Val 


Lys 


Phe 


465 








470 








475 










480 


Glu Gly Asp 


Thr 


Leu 


Val 


Asn Arg 


He 


Glu Leu Lys 


Gly 


He 


Asp 


Phe 








485 










490 








495 
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Lys 


Glu 


Asp 


Gly 


Asn 


lie 


Leu 


Gly 




500 










Ser 


His 


Asn 
515 


Val 


Tyr 


lie 


Met 


Ala 
520 


Val 


Asn 
530 


Phe 


Lys 


lie 


Arg 


His 
535 


Asn 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


545 








550 






Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 








565 








Pro 


Asn 


Glu 


Lys 
580 


Arg 


Asp 


His 


Met 


Ala 


Gly 


lie 


Thr 


Leu 


Gly 


Met 


Asd 




595 










600 



<210> 14 
<21i> 673 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> MCHlR-iinker-EGFP 



<400> 14 



Met 


Ser 


Val 


Gly 


Ala 

5 


Met 


Lys 


Lys 


1 

Gly 


Gly 


Gly 


Ser 


Gly 


Cys 


Gin 


Ala 






20 










Cys 


Gly 


Ala 


Cys 


Ala 


Pro 


Gly 


Gin 




35 










40 


Gin 


Pro 


Ala 


Trp 


Val 


Glu 


Gly 


Ser 




50 










55 




Thr 


Gly 


Thr 


Gly 


Trp 


Met 


Asp 


Leu 


65 










70 






Pro 


Asn 


Ala 


Ser 


Asn 


Thr 


Ser 


Asp 










85 








Gly 


Ser 


Pro 


Pro 


Arg 


Thr 


Gly 


Ser 






100 










Pro 


Ser 


Val 


Phe 


Gly 


Thr 


He 


Cys 






115 










120 


Thr 


Val 


He 


Phe 


Ala 


Val 


Val 


Lys 




130 










135 




Asn 


Val 


Pro 


Asp 


He 


Phe 


He 


He 


145 










150 






Phe 


Leu 


Leu 


Gly 


Met 


Pro 


Phe 


Met 










165 








Val 


Trp 


His 


Phe 


Gly 


Glu 


Thr 


Met 








180 










Ala 


Asn 


Ser 


Gin 


Phe 


Thr 


Ser 


Thr 






195 










200 


Asp 


Arg 


Tyr 


Leu 


Ala 


Thr 


Val 


His 




210 










215 




Lys 


Pro 


Ser 


Met 


Ala 


Thr 


Leu 


Val 


225 










230 






Phe 


lie 


Ser 


He 


Thr 


Pro 


Val 


Trp 










245 








Pro 


Gly 


Gly 


Ala 


Val 


Gly 


Cys 


Gly 








260 










Asp 


Leu 


Tyr 


Trp 


Phe 


Thr 


Leu 


Tyr 






275 










280 


Pro 


Phe 


Val 


Val 


He 


Thr 


Ala 


Ala 




290 










295 





His 


Lys 


Leu 


Glu 


Tyr- 


Asn 


Tyr 


Asn 


505 










510 






Asp 


Lys 


Gin 


Lys 


Asn 


Gly 


lie 


Lys 










525 








He 


Glu 


Asp 


Gly 


Ser 


Val 


Gin 


Leu 








540 










Pro 


He 


Gly 


Asp 


Gly 


Pro 


Val 


Leu 






555 










560 


Thr 


Gin 


Ser 


Ala 


Leu 


Ser 


Lys 


Asp 




570 










575 




Val 


Leu 


Leu 


Glu 


Phe 


Val 


Thr 


Ala 


585 










590 








Leu 


Tyr 


Lys 










Gly 


Val 


Gly 


Arg 


Ala 


Val 


Gly 


Leu 




10 










lb 




Thr 


Glu 


Glu 


Asp 


Pro 


Leu 


Pro 


Asn 


25 
















Gly 


Gly 


Arg 


Arg 


Trp 


Arg 


T m * 

Leu 


Pro 










45 








Ser 


Ala 


Arg 


Leu 


Trp 


Glu 


Gin 


Ala 








tl A 

60 










Glu 


Ala 


Ser 


Leu 


Leu 


Pro 


Thr 


Gly 






75 










Q A 


Gly 


Pro 


Asp 


Asn 


Leu 


ml, „ 

Tnr 


Ser 


Ala 




90 










y d 




lie 


Ser 


Tyr 


lie 


Asn 


lie 


lie 


Met 


105 










1 1 A 
110 






Leu 


Leu 


Gly 


He 


lie 


Gly 


Asn 


Ser 










125 








Lys 


Ser 


Lys 


Leu 


His 


Trp 


Cys 


Asn 








140 










Asn 


Leu 


Ser 


val 


val 


Asp 


Leu 


Leu 






155 










1 C A 

lbU 


He 


His 


Gin 


Leu 


Met 


Gly 


Asn 


Gly 




170 










175 




Cys 


Thr 


Leu 


He 


Thr 


Ala 


Met 


Asp 


185 










190 






Tyr 


Tip 

± ± fcr 


Leu 


L ll-L 


nla 


Mpt- 


Ala 


He 








205 








Pro 


lie 


Ser 


Ser 


Thr 


Lys 


Phe 


Arg 








220 










lie 


Cys 


Leu 


Leu 


Trp 


Ala 


Leu 


Ser 




235 










240 


Leu 


Tyr 


Ala 


Arg 


Leu 


lie 


Pro 


Phe 




250 










255- 




lie 


Arg 


Leu 


Pro 


Asn 


Pro 


Asp 


Thr 


265 








270 






Gin 


Phe 


Phe 


Leu 


Ala 


Phe 


Ala 


Leu 










285 








Tyr 


Val 


Lys 


He 


Leu 


Gin 


Arg 


Met 



300 
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Thr 


Ser 


Ser 


Val 


Ala 


Pro 


Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg 


Leu 


Arg 


Thr 


305 










310 










315 










320 


Lys 


Arg 


Val 


Thr 


Arg 


Thr 


Ala 


He 


Ala 


He 


Cys 


Leu 


Val 


Phe 


Phe 


Val 






325 










330 










335 




Cys 


Trp 


Ala 


Pro 


Tyr 


Tyr 


Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


Ser 


He 


Ser 




340 










345 










350 






Arg 


Pro 


Thr 


Leu 


Thr 


Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


He 


Ser 


Leu 




355 










360 










365 








Gly 


Tyr 


Ala 


Asn 


Ser 


Cys 


Leu 


Asn 


Pro 


Phe 


Vax 


Tyr 


He 


Val 


Leu 


Cys 


370 










375 










380 










Glu 


Thr 


Phe 


Arg 


Lys 


Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 


Ala 


Gin 


385 






390 










395 










400 


Gly Gin 


Leu 


Arg 


Thr 


Val 


Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu 


Glu 


Arg 








405 










410 










415 




Thr 


Glu 


Ser 


Lys 
420 


Gly 


Thr 


Val 


Asp 


Gly 
425 


Thr 


Ala 


Gly 


Pro 


Gly 
430 


Ser 


He 


Ala 


Thr 


Met 
435 


Val 


Ser 


Lys 


Gly 


Glu 
440 


Glu 


Leu 


Phe 


Thr 


Gly 
445 


Val 


Val 


Pro 


He 


Leu 
450 


Val 


Glu 


Leu 


Asp 


Gly 
455 


Asp 


Val 


Asn 


Gly 


His 
460 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 


Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


465 








470 










475 










480 


Phe 


He 


Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Vai 


Pro 


Trp 


Pro 


Thr 


Leu 


Val 








485 










490 










495 




Thr 


Thr 


Leu 


Thr 


Tyr 


Gly 


Vai 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 








500 








505 










510 






Met 


Lys 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 




515 










520 










525 








Gin 


Glu 
530 


Arg 


Thr 


He 


Phe 


Phe 
535 


Lys 


Asp 


Asp 


Gly 


Asn 
540 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


545 








550 










555 










560 


Lys 


Gly 


lie 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 








565 










570 










575 




Glu 


Tyr 


Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 






580 










585 










590 






Lys 


Asn 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 




595 










600 










605 








Gly 


Ser 
610 


Val 


Gin 


Leu 


Ala 


Asd 
615 


His 


Tyr 


Gin 


Gin 


Asn 
620 


Thr 


Pro 


He 


Gly 


Asp 


Gly 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


625 










630 










635 










640 


Ala 


Leu 


Ser 


Lys 


Aso 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 








645 










650 










655 




Glu 


Phe 


Val 


Thr 


Ala 


Ala 


Gly 


lie 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 



660 665 670 



Lys 



<210> 15 
<211> 1269 
<212> DNA 
<213> Human 

<400> 15 

atgtcagtgg gagccatgaa gaagggagtg gggagggcag ttgggcttgg aggcggcagc 60 

ggctgccagg ctacggagga agaccccctt cccaactgcg gggcttgcgc tccgggacaa 120 

ggtggcaggc gctggaggct gccgcagcct gcgtgggtgg aggggagctc agctcggttg 180 

tgggagcagg cgaccggcac tggctggatg gacctggaag cctcgcngct gcccactggt 240 

cccaacgcca gcaacacctc tgatggcccc gataacctca cttcggcagg atcacctcct 300 

cgcacgggga gcatctccta catcaacatc atcatgcctt cggtgttcgg caccatctgc 360 

ctcctgggca tcatcgggaa ctccacggtc atcttcgcgg tcgtgaagaa gtccaagctg 42 0 
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cactggtgca 
tttctcctgg 
ggggagacca 
tacatcctga 
acgaagttcc 
ttcatcagca 
gtgggctgcg 
cagtttttcc 
ctgcagcgca 
aagagggtga 
tactatgtgc 
ttatacaatg 
atcgtgctct 
gggcagcttc 
ggcacctga 

<210> 16 
<211> 1062 
<212> DNA 
<213> Human 



acaacgtccc 
gcatgccctt 
tgtgcaccct 
ccgccatggc 
ggaagccctc 
tcacccctgt 
gcatacgcct 
tggcctttgc 
tgacgtcctc 
cccgcacagc 
tacagctgac 
cggccatcag 
gtgagacgtt 
gcgctgtcag 



cgacatcttc 
catgatccac 
catcacggcc 
cattgaccgc 
tgtggccacc 
gtggctgtat 
gcccaaccca 
cctgcctttt 
agtggccccc 
catcgccatc 
ccagttgtcc 
cttgggctat 
ccgcaaacgc 
caacgctcag 



atcatcaacc 
cagctcatgg 
atggatgcca 
tacctggcca 
ctggtgatcc 
gccagactca 
gacactgacc 
gtggtcatca 
gcctcccagc 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtcctgt 
acggctgacg 



tctcggtag- 
gcaatggggt 
atagtcagtt 
ctgtccaccc 
gcctcctgtg 
tccccttccc 
tctactggtt 
cagccgcata 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaaccc 
cggtgaagcc 
aggagaggac 



agatctccrc 
gtggcactcc 
caccagcacc 
cat c tec tec 
ggccctctcc 
aggaggtgea 
caccctgtac 
cgtgaggatc 
getgeggaca 
ctgggcaccc 
ctttgtctac 
ctttgtgtac 
tgcagcccag 
agaaagcaaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1269 



<400> 16 

atggacctgg 

cccgataacc 

ateatcatge 

gtcatcttcg 

ttcatcatca 

caccagctca 

gccatggatg 

cgctacctgg 

accctggtga 

tatgecagae 

ccagacactg 

tttgtggtca 

cccgcctccc 

atctgtctgg 

tccatcagcc 

tatgecaaca 

cgcttggtcc 

cagaeggctg 



aagcctcgct 
tcacttcggc 
cttcggtgtt 
cggtcgtgaa 
acctctcggt 
tgggcaatgg 
ccaatagtca 
ccactgtcca 
tctgcctcct 
tcatccccct 
acctctactg 
tcacagccgc 
agegcagcat 
tcttctttgt 
gcccgaccct 
gctgcctcaa 
tgtcggtgaa 
acgaggagag 



gctgcccact 
aggatcacct 
cggcaccatc 
gaagtccaag 
agtagatctc 
ggtgtggcac 
gttcaccagc 
ccccatctct 
gtgggccctc 
cccaggaggt 
gttcaccctg 
ataegtgagg 
ccggctgcgg 
gtgctgggca 
cacctttgtc 
cccctttgtg 
gcctgcagcc 
gacagaaagc 



ggtcccaatg 
cctcgcacgg 
tgcctcctgg 
ctgcactggt 
ctctttctcc 
tttggggaga 
acctacatcc 
tccacgaagt 
tccttcatca 
gcagtgggct 
taccagtttt 
atcctgcagc 
acaaagaggg 
ccctactatg 
tacttataca 
tacatcgtgc 
caggggcagc 
aaaggcacct 



ccagcaacac 
ggagcatctc 
geatcategg 
geaacaaegt 
tgggcatgee 
ccatgtgcac 
tgaccgccat 
tccggaagcc 
gcatcacccc 
gcggcatacg 
tcctggcctt 
gcacgacgtc 
tgacccgcac 
tgetacaget 
atgeggecat 
tctgtgagac 
ttcgcgctgt 
ga 



ctctgatggc 
ctacatcaac 
gaactccacg 
ccccgacatc 
cttcatgatc 
cctcatcacg 
ggecattgae 
Ctctgtggcc 
tgtgtggctg 
cctgcccaac 
tgccctgcct 
ctcagtggcc 
agccatcgcc 
gacccagttc 
cagcttgggc 
gttccgcaaa 
cagcaacgct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1062 



<210> 17 
<211> 2080 
<212> DNA 
<213> Mouse 



<220> 

<221> misc_f eature 
<222> (1) . . . (2080) 
<223> n = A,T,C or G 



<400> 17 

ggeggtagag gaagaccctt ttctggactg eggggctcaa gctccggaca aggcggtgga 60 

gggcgctgga ggctgccgca gcctgcgtgg gtggacgggc gctccactcc agggagcagg 120 

cgacctgcac eggctgeatg gatctgeaag cctcgttgct gtccactggc cccaatgcca 180 

gcaacatctc cgatggccag gataatttca cattggcggg gccacctcct cgcacaagga 240 

gtgtctccta catcaacatc ateatgeett cagtgtttgg taccatctgt ctcctgggca 3 00 

ttgtgggaaa ctccacagtc atttttgecg tggtgaagaa atccaagctg cactggtgca 3 60 

gcaacgtccc tgacatcttc atcatcaacc tctctgtggt ggatctgett ttcctgctgg 420 

geatgecttt catgatccac cagctcatgg gtaatggtgt ctggcacttt ggggaaacca 480 

tgtgcaccct catcacagcc atggacgcca acagtcagtt caccagcacc tacatcctga 540 

ctgctatggc cattgaccgc tacttggeca ccgtccatcc catctcctcc accaagttcc 600 
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ggaagcccuc catggccacc ctggtgatct gcctcctgtg qgczczczcg ctca'cagca 660 

tcactcctgt gtggctctat gccaggctta tccccttccc agggggtgct gtgggctgtg 720 

gcatccgcct accaaaccca gatactgatc tttactggtt caccctgtat cagtttctcc 780 

tggccttcgc ccttccgttt gcggtcatca ctgctgcgta cgtgaaaana ctacagcgca 840 

tgacgtcttc ggcggcccca gcctctcaac gcagcatccg gcttcggaca aagagggtga 900 

cccgcacagc cattgccatc tgtctggtct tctttgtgtg ctgggcgccc tactacgtgc 960 

tgcagctgac ccagttgtcc atcagccgcc cgaccctcac attcgtctac ctgtacaatg 1020 

cggccatcag cttgggctat gccaacagct gcctcaatcc ctttgtgtac atagtactct 1080 

gtgagacctt tcgaaaacgc ttggtgctgt cggtgaagcc cgcggcccag gggcagcttc 1140 

gcacggtcag caatgctcag acagctgacg aggagaggac agaaagcaaa ggcacctgac 1200 

aatccccccc ggtcacctcc aagtcaggtc accgcatcaa accatgggga gagaractga 12 60 

gataaacccg gggctaccct gggaggatgc agaagctgga ggctgggggc ttgtagcaaa 1320 

ccacattcca cggggcccac aaattgctag ggaggcttgc agcctggrtt gggggggaag 1380 

cctcagactg cagggatccc cttgacagaa tagaagcgga gcaagaagga aagggtggtt 1440 

tgactggttc tcggggtctg tatctgttgg ctcgcatata tctttctctc aagggaagaa 1500 

ggcggaggtg cctagctggg ttcctttaaa actaggcagg gctaggatct gagcagctag 1560 

ggctctactg tgagactggg caagccgagc gttccctccc atctctcatt ggtgttgata 1620 

gaaggcagtc tttctcccaa gctggtggat ctcctgaagc acgctgcctg ggctccagca 1680 

tcctgtgcgg atttcacgtt ctctttaggg gatgcatgtt gacactgggg tgtgggctct 1740 

gagcccacag gagtttaaaa aaccaaaaga gctcagagtg tcgagagaga cccaatcacc 1800 

gagaatgaca aggcaacctg gggtggatgt ggatcttgaa actaataaaa aggggttttc 1860 

acagtgacag cgacattctc ttcatagggc acagctgtca gtctatggct gatccagagc 1920 

gagcatccat gaattctgca tgtgcagggg tcactctaat acctgatatg ttggcatcat 1980 

ctttgtgctt gagccttccn ctcccaaatg ggaatgaaat aaaggcaaat tcccnccccc 2040 

cccaaaaaag gggnaaaaaa aaaaaaaaaa aaaaaaaaaa 2 080 

<210> 18 
<211> 3357 
<212> DNA 
<213> Mouse 

<220> 

<221> misc_f eature 
<222> (1) . . . (3357) 
<223> n = A,T,C or G 

<400> 18 

ggcggtagag gaagaccctt ttctggactg cggggctcaa gctccggaca aggcggtgga 60 

gggcgctgga ggctgccgca gcctgcgtgg gtggacgggc gctccactcc agggagcagg 120 

cgacctgcac cggctgcatg gatctgcaag cctcgttgct gtccactggc cccaatgcca 180 

gcaacatctc cgatggccag gataatttca cattggcggg tgagtcgagt tggagtcctc 240 

cctcctccgg gatgggtgtg gaaaatggga aggtttcacc tcccaagcca aactgcctgg 300 

gaaactttat cttacagttc ttggtgataa gatctgcagt cggctttgcc tgaagaggaa 360 

gaggagagga ggggacacca gctaggacag aaggggcagg gaggaataga gatggggcag 420 

aggcacattt agaaacaaca agggttggtg acaagacgtg aggcaggctt gaggggaaag 480 

cttgctgatg agtcccaaat atgctttgca gggggggggg ggggggaatc aaggctggag 540 

aagcaagcaa gcaagacagc aagacagcgg gcgggtagta tgtgggagcc agcagaagcg 600 

ctttgattca ccgctatcct gggctcaatc ctctggcctc gcactgggga aatggggtct 660 

gagtggtcct tgctgtcttc tggcaaaggc tgctgggagc aaaagacttc acagggcgtg 720 

agaggattaa cttttctggt gaattaagct tcttgacatt tgcagaacgt caatgcctta 780 

aaattctagc tctgaaggag aagggaatga aggggaaaga gggaaggttg gtgtggagaa 840 

attcccaagc ttctggggtg taacacagct ccagtcccta ccctattggg aaagcccaga 900 

ctcaggagac atggtccaag gaaatccctg acagaaaacc gggagagggc agggctgtgg 960 

agcctgaaac acaccccaca cccatggtga cagtcacttc tcacatatgc ctaggaacct 1020 

atctgaaacc tttggccatc tctctctgaa aagatgaggc tgcaaataca cacacacaca 1080 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca aatgticcttc 1140 

aagccttttt gacaaggttt tctggtggat cccggggata tgaagttgtt ctcagcagat 12 00 

atctgggagt cttgactcct ggccctctga ctaaatggat gaagcgaaga agaatggggt 1260 

cctctgagta acaggtggat ctagaaaatc ctataggagt caccagggca cggtggagga 1320 

gggtaaggta cagaactaac aatagcccga gaaggggaaa cagcaggaga tgattccaga 13 80 

gacgtagtga ccccaagctg caagggaaac catgaggggc cagcaggaag gccgacatgg 1440 

caggttgtca gcttctagat cggaaggcgg gtcacacttg ctctttctat cctcagggcc 1500 
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acctcctcgc acaaggagtg tctcctacat caacatcatc atgccttcag tgtttggnac 156 0 

catctgtctc ctgggcattg tgggaaactc cacagtcatt tttgccgtgg tgaagaaatc 1620 

caagctgcac tggtgcagca acgtccctga catcttcatc atcaacctct ctgtggtgga 1680 

tctgcttttc ctgctgggca tgcctttcat gatccaccag ctcatgggta atggtgtctg 1740 

gcactttggg gaaaccatgt gcaccctcat cacagccatg gacgccaaca gtcagttcac 1800 

cagcacctac atcctgactg ctatggccat tgaccgctac ttggccaccg tccaccccat 1860 

ctcctccacc aagttccgga agccctccat ggccaccctg gtgatctgcc tcctgtgggc 1920 

tctctcgttc artagcatca ctcctgtgtg gctctatgcc aggcttatcc ccttcccagg 1980 

gggtgctgtg ggctgtggca tccgcctacc aaacccagat actgatcttt actggttcac 2040 

tctgtatcag tttttcctgg ccttcgccct tccgtttgtg gtcatcactg ctgcgtacgt 2100 

gaaaatacta cagcgcatga cgtcttcggt ggccccagcc tctcaacgca gcatccggct 2160 

tcggacaaag agggtgaccc gcacagccar tgccatctgt ctggtcttct ttgtgtgctg 2220 

ggcgccctac tacgtgctgc agctgaccca gttgtccatc agccgcccga ccctcacatt 22 8 0 

cgcctacctg tacaatgcgg ccatcagctt gggctatgcc aacagctgcc tcaatccctt 2340 

tgtgtacata gtactctgtg agacctttcg aaaacgcttg gtgctgtcgg tgaagcccgc 2400 

ggcccagggg cagcttcgca cggtcagcaa tgctcagaca gctgacgagg agaggacaga 2460 

aagcaaaggc acctgacaat cccccccggt cacctccaag tcaggtcacc gcatcaaacc 252 0 

atggggagag atactgagat aaacccgggg ctaccctggg aggatgcaga agctggaggc 2580 

tgggggcttg tagcaaacca cattccacgg ggcccacaaa ttgctaggga ggcttgcagc 2640 

ctggtttggg ggggaagcct cagactgcag ggatcccctt gacagaatag aagcggagca 2700 

agaaggaaag ggtggtttga ctggttctcg gggtctgtat ctgttggctc gcatatatct 2760 

ttctctcaag ggaagaaggc ggaggtgcct agctgggttc ctttaaaact aggcagggct 2820 

aggatctgag cagctagggc tctactgtga gactgggcaa gccgagcgtt ccctcccatc 2880 

tctcattggt gttgatagaa ggcagtcttt ctcccaagct ggtggatctc ctgaagcacg 2940 

ctgcctgggc tccagcatcc tgtgcggatt tcacgttctc tttaggggat gcatgttgac 3 000 

actggggtgt gggctctgag cccacaggag tttaaaaaac caaaagagct cagagtgtcg 3 060 

agagagaccc aatcaccgag aatgacaagg caacctgggg tggatgtgga tcttgaaact 3120 

aataaaaagg ggttttcaca gtgacagcga cattctcttc atagggcaca gctgtcagtc 3180 

tatggctgat ccagagcgag catccatgaa ttctgcatgt gcaggggtca ctctaatacc 3240 

tgatatgttg gcatcatctt tgtgcttgag ccttccnctc ccaaatggga atgaaataaa 33 00 

ggcaaattcc cncccccccc aaaaaagggg naaaaaaaaa aaaaaaaaaa aaaaaaa 33 57 

<210> 19 
<211> 1062 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human short form/mouse species chimeric MCH1R 
<400> 19 

atggacctgg aagcctcgct gctgcccact ggtcccaatg ccagcaacac ctctgatggc 60 

cccgataacc tcacttcggc aggatcacct cctcgcacgg ggagcatctc ctacatcaac 120 

atcatcatgc cttcggtgtt cggcaccatc tgcctcctgg gcatcatcgg gaactccacg 180 

gtcatcttcg cggtcgtgaa gaagtccaag ctgcactggt gcaacaacgt ccccgacatc 240 

ttcatcatca acctctcggt agtagatctc ctctttctcc tgggcatgcc cttcatgatc 300 

caccagctca tgggcaatgg ggtgtggcac tttggggaga ccatgtgcac cctcatcacg 3 60 

gccatggatg ccaatagtca gttcaccagc acctacatcc tgaccgccat ggccattgac 420 

cgctacctgc ccactgtcca ccccatctct tccacgaagt tccggaagcc ctccatggcc 480 

accctggtga tctgcctcct gtgggctctc tcgttcatta gcatcactcc tgtgtggctc 540 

tatgccaggc ttatcccctt cccagggggt gctgtgggct gtggcatccg cctaccaaac 600 

ccagatactg atctttactg gttcactctg tatcagtttt tcctggcctt cgcccttccg 660 

tttgtggtca tcactgctgc gtacgtgaaa atactacagc gcatgacgtc ttcggtggcc 720 

ccagcctctc aacgcagcat ccggcttcgg acaaagaggg tgacccgcac agccattgcc 780 

atctgtctgg tcttctttgt gtgctgggcg ccctactacg tgctgcagct gacccagttg 840 

tccatcagcc gcccgaccct cacattcgtc tacctgtaca atgcggccat cagcttgggc 900 

tatgccaaca gctgcctcaa tccctttgtg tacatagtac tctgtgagac ctttcgaaaa 960 

cgcttggtgc tgtcggtgaa gcccgcggcc caggggcagc ttcgcacggt cagcaatgct 1020 

cagacagctg acgaggagag gacagaaagc aaaggcacct ga 1062 

<210> 20 
<21i> 1269 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human long form/mouse species chimeric MCH1R 



<400> 20 

atgtcagtgg 

ggctgccagg 

ggtggcaggc 

tgggagcagg 

cccaacgcca 

cgcacgggga 

ctcctgggca 

cactggtgca 

tttctcctgg 

ggggagacca 

tacatcctga 

acgaagttcc 

ttcattagca 

gtgggcrgtg 

cagtttttcc 

ctacagcgca 

aagagggtga 

tactacgtgc 

ctgracaatg 

atagtactct 

gggcagcttc 

ggcacctga 



gagccatgaa 
ctacggagga 
gctggaggct 
cgaccggcac 
gcaacacctc 
gcatctccta 
tcatcgggaa 
acaacgtccc 
gcatgccctt 
tgtgcaccct 
ccgccatggc 
ggaagccctc 
tcactcctgt 
gcatccgcct 
tggccttcgc 
tgacgtcttc 
cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 



gaagggagtg 
agaccccctt 
gccgcagcct 
tggctggatg 
tgatggcccc 
catcaacatc 
ctccacggtc 
cgacatcttc 
catgatccac 
catcacggcc 
cattgaccgc 
catggccacc 
gtggctctat 
accaaaccca 
ccttccgttt 
ggtggcccca 
cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 



gggagggcag 
cccaactgcg 
gcgtgggtgg 
gacctggaag 
gataacctca 
atcatgcctt 
atcttcgcgg 
atcatcaacc 
cagctcatgg 
atggatgcca 
tacctggcca 
ctggtgatct 
gccaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 



ttgggcttgg 
gggcttgcgc 
aggggagctc 
cctcgctgct 
cttcggcagg 
cggtgttcgg 
tcgtgaagaa 
tctcggtagt 
gcaatggggt 
atagtcagtt 
ctgtccaccc 
gcctcctgtg 
tccccttccc 
tttactggtt 
ctgctgcgta 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaatcc 
cggtgaagcc 
aggagaggac 



aggcggcagc 
tccgggacaa 
agctcggttg 
gcccactggt 
atcacctcct 
caccatctgc 
gtccaagctg 
agatctcctc 
gtggcacttt 
caccagcacc 
catctcttcc 
ggctctctcg 
agggggtgct 
cactctgtat 
cgtgaaaata 
gcttcggaca 
ctgggcgccc 
attcgtctac 
ctttgtgtac 
cgcggcccag 
agaaagcaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1269 



<210> 21 
<211> 966 
<212> DNA 

<213> Aeguorea Victoria 



<400> 21 

tacacacgaa 

cccaattctt 

gggtgaaggt 

actacctgtt 

aagataccca 

tgtacaggaa 

caagtttgaa 

agatggaaac 

catggcagac 

agatggaagc 

tgtcctttta 

cgaaaagaga 

catggatgaa 

attactaaaa 

agattcatta 

caaacaggct 

ctcaaa 



taaaagataa 
gttgaattag 
gatgcaacat 
ccatggccaa 
gatcatatga 
agaactatat 
ggtgataccc 
attcttggac 
aaacaaaaga 
gttcaactag 
ccagacaacc 
gaccacatgg 
ctatacaaat 
tctcagggtt 
aaattgtatg 
acttggagtg 



caaagatgag 
atggtgatgt 
acggaaaact 
cacttgtcac 
aacagcatga 
ttttcaaaga 
ttgttaatag 
acaaattgga 
atggaatcaa 
cagaccatta 
attacctgtc 
tccttcttga 
aaatgtccag 
cctggttaaa 
aataatttat 
tattcttaat 



taaaggagaa 
taatgggcac 
tacccttaaa 
tactttctct 
ctttttcaag 
tgacgggaac 
aatcgagtta 
atacaactat 
agttaacttc 
tcaacaaaat 
cacacaatct 
gtttgtaaca 
acttccaatt 
ttcaggctga 
tgatgttatt 
tctatattaa 



gaacttttca 
aaattttctg 
tttatttgca 
tatggtgttc 
agtgccatgc 
tacaagacac 
aaaggtattg 
aactcacaca 
aaaattagac 
actccaattg 
gccctttcga 
gctgctggga 
gacactaaag 
gatattattt 
gatagaggtt 
ttacaatttg 



ctggagttgt 
tcagtggaga 
ctactggaaa 
aatgcttttc 
ccgaaggtta 
gtgctgaagt 
attttaaaga 
atgtatacat 
acaacattga 
gcgatggccc 
aagatcccaa 
ttacacatgg 
tgtccgaaca 
atatatttat 
attrtcttat 
atttgacttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<210> 22 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 
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<400> 22 

gtcgacggta ccgcgggccc gggatccatc gccaccatgg tgagcaaggg cgaggagctg 60 

ttcaccgggg tggtgcccat cctggtcgag ctggacggcg acgtaaacgg ccacaagttc 120 

agcgtgtccg gcgagggcga gggcgatgcc acctacggca agctgaccct gaagttcatc 180 

tgcaccaccg gcaagctgcc cgtgccctgg cccaccctcg tgaccaccct gacctacggc 240 

gtgcagtgct tcagccgcta ccccgaccac atgaagcagc acgacttctt caagtccgcc 300 

atgcccgaag gctacgtcca ggagcgcacc atcttcttca aggacgacgg caactacaag 3 60 

acccgcgccg aggtgaagtt cgagggcgac accctggtga accgcatcga gctgaagggc 420 

atcgactcca aggaggacgg caacatcctg gggcacaagc tggagtacaa ctacaacagc 480 

cacaacgtct atatcatggc cgacaagcag aagaacggca tcaaggtgaa cttcaagatc 540 

cgccacaaca tcgaggacgg cagcgtgcag ctcgccgacc actaccagca gaacaccccc 600 

atcggcgacg gccccgtgct gctgcccgac aaccactacc tgagcaccca gtccgccctg 66 0 

agcaaagacc ccaacgagaa gcgcgatcac atggtcctgc tggagttcgt gaccgccgcc 720 

gggatcactc tcggcatgga cgagctgtac aagtaaagcg gccgc 765 

<210> 23 
<211> 720 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> G?P derivative 
<400> 23 

atggtgagca agggcgagga 
ggcgacgtaa acggccacaa 
ggcaagctga ccctgaagtt 
ctcgtgacca ccttgaccta 
cagcacgact tcttcaagtc 
ttcaaggacg acggcaacta 
gtgaaccgca tcgagctgaa 
aagctggagt acaactacaa 
ggcatcaagg tgaacttcaa 
gaccactacc agcagaacac 
tacctgagca cccagtccgc 
ctgctggagt tcgtgaccgc 

<210> 24 
<211> 720 
<2i2> DNA 
<213> Artificial Sequence 

<220> 

<223> GFP derivative 
<400> 24 

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60 

ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120 

ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180 

ctcgtgacca ccttcggcta cggcgtgcag tgcttcgccc gctaccccga ccacatgcgc 240 

cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300 

ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360 

gtgaaccgca tcgagctgaa gggcatcgac ctcaaggagg acggcaacat cctggggcac 420 

aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480 

ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540 

gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600 

tacctgagct accagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660 

ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 720 

<210> 25 
<211> 720 
<212> DNA 



gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60 

gttcagcgtg tccggcgagg gcgagggcga tgccacctac 12 0 

catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180 

cggcgtgcag tgcttcgccc gctaccccga ccacatgaag 240 

cgccatgccc gaaggctacg tccaggagcg caccatcttc 300 

caagacccgc gccgaggtga agttcgaggg cgacaccctg 360 

gggcatcgac ttcaaggagg acggcaacat cctggggcac 420 

cagccacaag gtctatatca ccgccgacaa gcagaagaac 480 

gacccgccac aacatcgagg acggcagcgt gcagctcgcc 540 

ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600 

cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660 

cgccgggatc actctcggca tggacgagct gtacaagtaa 720 
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<213> Artificial Sequence 
<220> 

<223> GFP derivative 



<400> 25 

atggtgagca 

ggcgacgtaa 

ggcaagctga 

ctcgtgacca 

cagcacgac: 

ttcaaggacg 

gtgaaccgca 

aagctggagt 

ggcatcaagg 

gaccactacc 

tacctgagca 

ctgctggagt 



agggcgagga 
acggccacag 
ccctgaagtc 
ccctgacctg 
tcttcaagtc 
acggcaacta 
tcgagctgaa 
acaactacat 
cccacttcaa 
agcagaacac 
cccagtccgc 
tcgtgaccgc 



gctgttcacc 
gttcagcgtg 
catctgcacc 
gggcgtgcag 
cgccatgccc 
caagacccgc 
gggcatcgac 
cagccacaac 
gatccgccac 
ccccatcggc 
cctgagcaaa 
cgccgggatc 



ggggtggtgc 

tccggcgagg 
accggcaagc 
tgcttcagcc 
gaaggctacg 
gccgaggtga 
ttcaaggagg 
gtctatatca 
aacatcgagg 
gacggccccg 
gaccccaacg 
actctcggca 



ccatcctggt 
gcgagggcga 
tgcccgtgcc 
gctaccccga 
tccaggagcg 
agttcgaggg 
acggcaacat 
ccgccgacaa 
acggcagcgt 
tgctgctgcc 
agaagcgcga 
tggacgagct 



cgagctggac 
tgccacctac 
ctggcccacc 
ccacatgaag 
taccatcttc 
cgacaccctg 
cctggggcac 
gcagaagaac 
gcagctcgcc 
cgacaaccac 
tcacatggtc 
gtacaagtaa 



<210> 26 
<2il> 3092 
<212> DNA 

<213> Artificial Sequence 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



<220> 

<223> Mouse MCH1R-1 inker- EGFP 



<400> 26 

atggatctgc aagcctcgtt gctgtccact ggccccaatg ccagcaacat ctccgatggc 60 
caggataatt tcacattggc gggtgagtcg agttggagtc ctccctcctc cgggatgggt 120 
gtggaaaatg ggaaggtttc acctcccaag ccaaactgcc tgggaaactt tatcttacag 180 
ttcttggtga taagatctgc agtcggcttt gcctgaagag gaagaggaga ggaggggaca 240 
ccagctagga cagaaggggc agggaggaat agagatgggg cagaggcaca tttagaaaca 3 00 

acaagggttg gtgacaagac gtgaggcagg cttgagggga aagcttgctg atgagtccca 3 60 

aatatgcttc gcaggggggg ggggggggga atcaaggctg gagaagcaag caagcaagac 420 
agcaagacag cgggcgggta gtatgtggga gccagcagaa gcgctttgat tcaccgctat 480 
cctgggctca atcctctggc ctcgcacrgg ggaaatgggg tctgagtggt ccttgctgtc 540 
ttctggcaaa ggctgctggg agcaaaagac ttcacagggc gtgagaggat taacttttct 600 
ggtgaattaa gcttcttgac atttgcagaa cgtcaatgcc ttaaaattct agctctgaag 660 
gagaagggaa tgaaggggaa agagggaagg ttggtgtgga gaaattccca agcttctggg 720 
gtgtaacaca gctccagtcc ctaccctatt gggaaagccc agactcagga gacatggtcc 780 
aaggaaatcc ctgacagaaa accgggagag ggcagggctg tggagcctga aacacacccc 840 
acacccatgg tgacagtcac ttctcacata tgcctaggaa cctatctgaa acctttggcc 900 
atctctctct gaaaagatga ggctgcaaat acacacacac acacacacac acacacacac 960 

acacacacac acacacacac acacacacac acaaatgtcc ttcaagcctt tttgacaagg 1020 

ttttctggtg gatcccgggg atatgaagtt gttctcagca gatatctggg agtcttgact 1080 

cctggccctc tgagtaaatg gatgaagcga agaagaatgg ggtcctctga gtaacaggtg 1140 

gatctagaaa atcctatagg agtcaccagg gcacggtgga ggagggtaag gtacagaact 12 00 

aacaatagcc cgagaagggg aaacagcagg agatgattcc agagacgtag tgaccccaag 12 60 

ctgcaaggga aagcatgagg ggccagcagg aaggccgaca tggcaggttg tcagcttcta 132 0 

gatcggaagg cgggtcacac ttgctctttc tatcctcagg gccacctcct cgcacaagga 13 80 

gtgtctccta catcaacatc atcatgcctt cagtgtttgg taccatctgt ctcctgggca 1440 

ttgtgggaaa ctccacagtc atttttgccg tggtgaagaa atccaagctg cactggtgca 1500 

gcaacgtccc tgacatcttc atcatcaacc tctctgtggt ggatctgctt ttcctgctgg 1560 

gcatgccttt catgatccac cagctcatgg gtaatggtgt ctggcacttt ggggaaacca 1620 

tgtgcaccct catcacagcc atggacgcca acagtcagtt caccagcacc tacatcctga 1680 

ctgctatggc cattgaccgc tacttggcca ccgtccatcc catctcctcc accaagttcc 1740 

ggaagccctc catggccacc ctggtgatct gcctcctgtg ggctctctcg ttcattagca 1800 

tcactcctgt gtggctctat gccaggctta tccccttccc agggggtgct gtgggctgtg 1860 

gcatccgcct accaaaccca gatactgatc tttactggtt cactctgtat cagtttttcc 1920 

tggccttcgc ccttccgttt gtggtcatca ctgctgcgta cgtgaaaata ctacagcgca 1980 

tgacgtcttc ggtggcccca gcctctcaac gcagcatccg gcttcggaca aagagggtga 2040 
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cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 
acggtaccgc 
ccggggtggt 
tgtccggcga 
ccaccggcaa 
agtgcttcag 
ccgaaggcta 
gcgccgaggt 
acttcaagga 
acgtctatat 
acaacatcga 
gcgacggccc 
aagaccccaa 
tcactctcgg 



cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 
gggcccggga 
gcccarcctg 
gggcgagggc 
gctgcccgtc 
ccgctacccc 
cgtccaggag 
gaagttcgag 
ggacggcaac 
catggccgac 
ggacggcagc 
cgtgctgctg 
cgagaagcgc 
catggacgag 



tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 
tccatcgcca 
gtcgagctgg 
gatgccacct 
ccctggccca 
gaccacatga 
cgcaccatct 
ggcgacaccc 
atcccggggc 
aagcagaaga 
gtgcagctcg 
cccgacaacc 
gatcacatgg 
ctgtacaagt 



tctttgtgtg ctgggcgccc tactacgtgc 
cgaccctcac attcgzctac ctgtacaatg 
gcctcaatcc ctttgtgtac acagtacccc 
cggtgaagcc cgcggcccag gggcagcttc 
aggagaggac agaaagcaaa ggcaccgtcg 
ccatggtgag caagggcgag gagctgttca 
acggcgacgt aaacggccac aagttcagcg 
acggcaagct gaccctgaag ttcatctgca 
ccctcgtgac caccctgacc tacggcgtgc 
agcagcacga cttcttcaag tccgccatgc 
tcttcaagga cgacggcaac tacaagaccc 
tggtgaaccg catcgagctg aagggcatcg 
acaagctgga gtacaactac aacagccaca 
acggcatcaa ggtgaacttc aagatccgcc 
ccgaccacta ccagcagaac acccccatcg 
actacctgag cacccagtcc gccctgagca 
tcctgctgga gttcgtgacc gccgccggga 
aa 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3092 



<210> 27 
<211> 3056 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Mouse MCH1R/EGF? direct fusion 
<400> 27 

atggatctgc aagccucgtt gctgtccact ggccccaatg ccagcaacat ctccgatggc "60 

caggataatt tcacattggc gggtgagtcg agttggagtc ctccctcctc cgggatgggt 120 

gtggaaaatg ggaaggtttc acctcccaag ccaaactgcc tgggaaactc tatcttacag 180 

ttcttggtga taagatctgc agtcggcttt gcctgaagag gaagaggaga ggaggggaca 240 

ccagctagga cagaaggggc agggaggaat agagatgggg cagaggcaca tttagaaaca 3 00 

acaagggttc gtgacaagac gtgaggcagg cttgagggga aagcttgctg atgagtccca 3 60 

aatatgcttt gcaggggggg ggggggggga atcaaggctg gagaagcaag caagcaagac 420 

agcaagacag cgggcgggta gtatgtggga gccagcagaa gcgctttgat tcaccgctat 480 

cctgggctca atcctctggc cncgcactgg ggaaatgggg tctgagtggt ccntgctgtc 540 

ttctggcaaa ggctgctggg agcaaaagac ttcacagggc grgagaggar taacttttct 600 

ggtgaattaa gcttcttgac atttgcagaa cgtcaatgcc ttaaaattct agctctgaag 660 

gagaagggaa tgaaggggaa agagggaagg ttggtgtgga gaaattccca agcttctggg 720 

gtgtaacaca gctccagtcc ctaccctatt gggaaagccc agactcagga gacatggtcc 780 

aaggaaatcc ctgacagaaa accgggagag ggcagggctg tggagcctga aacacacccc 840 

acacccatgg tgacagtcac ttctcacata tgcctaggaa cctatctgaa acctttggcc 900 

atctctctct gaaaagatga ggctgcaaat acacacacac acacacacac acacacacac 960 

acacacacac acacacacac acacacacac acaaatgtcc ttcaagcctt tttgacaagg 1020 

ttttctggtg gatcccgggg atatgaagtt gttctcagca gatatctggg agtcttgact 1080 

cctggccctc tgagtaaatg gatgaagcga agaagaatgg ggtcctctga gtaacaggtg 1140 

gatctagaaa atcctatagg agtcaccagg gcacggtgga ggagggtaag gtacagaact 1200 

aacaatagcc cgagaagggg aaacagcagg agatgattcc agagacgtag tgaccccaag 1260 

ctgcaaggga aagcatgagg ggccagcagg aaggccgaca tggcaggttg tcagcttcta 1320 

gatcggaagg cgggtcacac ttgctctttc tatcctcagg gccacctcct cgcacaagga 1380 

gtgtctccta catcaacatc atcatgcctt cagtgtttgg taccatctgt ctcctgggca 1440 

ttgtgggaaa ctccacagtc atttttgccg tggtgaagaa atccaagctg cactggtgca 1500 

gcaacgtccc tgacatcttc atcatcaacc tctctgtggt ggatctgctt ttcctgctgg 1560 

gcatgccttt catgatccac cagctcatgg gtaatggtgt ctggcacttt ggggaaacca 1620 

tgtgcaccct catcacagcc atggacgcca acagtcagtt caccagcacc tacatcctga 1680 

ctgctatggc cattgaccgc tacttggcca ccgtccatcc catctcctcc accaagttcc 1740 

ggaagccctc catggccacc ctggtgatct gcctcctgtg ggctctctcg ttcattagca 1800 

tcactcctgt gtggctctat gccaggctta tccccttccc agggggtgct gtgggctgtg 1860 

gcatccgcct accaaaccca gatactgatc tttactggtt cactctgtat cagtttttcc 1920 

tggccttcgc ccttccgttt gtggtcatca ctgctgcgta cgtgaaaata ctacagcgca 1980 

tgacgtcttc ggtggcccca gcctctcaac ccagcatccg gcttcggaca aagagggtga 2040 
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cccgcacagc cattgccatc tgtctggtct tctttgtgtg ctgggcgccc tactacgtgc 2100 

tgcagctgac ccagttgtcc atcagccgcc cgaccctcac attcgtctac ctgtacaatg 2160 

cggccatcag cttgggctat gccaacagct gcctcaatcc ctttgtgtac atagtactct 2220 

gtgagacctt tcgaaaacgc ttggtgctgt cggtgaagcc cgcggcccag gggcagcttc 2280 

gcacggtcag caatgctcag acagctgacg aggagaggac agaaagcaaa ggcaccatgg 2340 

tgagcaaggg cgaggagctg ttcaccgggg tggtgcccat cctggtcgag ctggacggcg 2400 

acgtaaacgg ccacaagttc agcgtgtccg gcgagggcga gggcgatgcc acctacggca 2460 

agctgaccct gaagttcatc tgcaccaccg gcaagctgcc cgtgccctgg cccaccctcg 2520 

tgaccaccct gacctacggc gtgcagtgct tcagccgcta ccccgaccac atgaagcagc 2580 

acgacttctt caagtccgcc atgcccgaag gctacgtcca ggagcgcacc atcttcttca 2640 

aggacgacgg caactacaag acccgcgccg aggtgaagtt cgagggcgac accctggtga 2700 

accgcatcga gctgaagggc atcgacttca aggaggacgg caacatcctg gggcacaagc 27 60 

tggagtacaa ctacaacagc cacaacgtct atatcatggc cgacaagcag aagaacggca 2820 

tcaaggtgaa cttcaagatc cgccacaaca tcgaggacgg cagcgtgcag ctcgccgacc 2880 

actaccagca gaacaccccc atcggcgacg gccccgtgct gctgcccgac aaccactacc 2940 

tgagcaccca gtccgccctg agcaaagacc ccaacgagaa gcgcgatcac atggtcctgc 3 000 

tggagttcgt gaccgccgcc gggatcactc tcggcatgga cgagctgtac aagtaa 3 05 6 



<210> 28 
<211> 1815 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human short form/mouse species chimeric 
MCHlR-linker-EGFP 



<400> 28 

atggacctgg aagcctcgct gctgcccact ggtcccaatg ccagcaacac ctctgatggc .60 

cccgataacc tcacttcggc aggatcacct cctcgcacgg ggagcatctc ctacatcaac 120 

atcatcatgc cttcggtgtt cggcaccatc tgcctcctgg gcatcatcgg gaactccacg 180 

gtcatcttcg cggtcgtgaa gaagtccaag ctgcactggt gcaacaacgt ccccgacatc 240 

ttcatcatca acctctcggt agtagatctc ctctttctcc tgggcatgcc cttcatgatc 300 

caccagctca tgggcaatgg ggtgtggcac tttggggaga ccatgtgcac cctcatcacg 3 60 

gccatggatg ccaatagtca gttcaccagc acctacatcc tgaccgccat ggccattgac 420 

cgctacctgg ccactgtcca ccccatctct tccacgaagt tccggaagcc ctccatggcc 480 

accctggtga tctgcctcct gtgggctctc tcgttcatta gcatcactcc tgtgtggctc 540 

tatgccaggc ttatcccctt cccagggggt gctgtgggct gtggcatccg cctaccaaac 600 

ccagatactg atctttactg gttcactctg tatcagtttt tcctggcctt cgcccttccg 660 

tttgtggtca tcactgctgc gtacgtgaaa atactacagc gcatgacgtc ttcggtggcc 720 

ccagcctctc aacgcagcat ccggcttcgg acaaagaggg tgacccgcac agccattgcc 780 

atctgtctgg tcttctttgt gtgctgggcg ccctactacg tgctgcagct gacccagttg 840 

tccatcagcc gcccgaccct cacattcgtc tacctgtaca atgcggccat cagcttgggc 900 

tatgccaaca gctgcctcaa tccctttgtg tacatagtac tctgtgagac ctttcgaaaa 960 

cgcttggtgc tgtcggtgaa gcccgcggcc caggggcagc ttcgcacggt cagcaatgct 1020 

cagacagctg acgaggagag gacagaaagc aaaggcaccg tcgacggtac cgcgggcccg 108 0 

ggatccatcg ccaccatggt gagcaagggc gaggagctgt tcaccggggt ggtgcccatc 1140 

ctggtcgagc tggacggcga cgtaaacggc cacaagttca gcgtgtccgg cgagggcgag 1200 

ggcgatgcca cctacggcaa gctgaccctg aagttcatct gcaccaccgg caagctgccc 1260 

gtgccctggc ccaccctcgt gaccaccctg acctacggcg tgcagtgctt cagccgctac 1320 

cccgaccaca tgaagcagca cgacttcttc aagtccgcca tgcccgaagg ctacgtccag 1380 

gagcgcacca tcttcttcaa ggacgacggc aactacaaga cccgcgccga ggtgaagttc 1440 

gagggcgaca ccctggtgaa ccgcatcgag ctgaagggca tcgacttcaa ggaggacggc 1500 

aacatcctgg ggcacaagct ggagtacaac tacaacagcc acaacgtcta tatcatggcc 1560 

gacaagcaga agaacggcat caaggtgaac ttcaagatcc gccacaacat cgaggacggc 1620 

agcgtgcagc tcgccgacca ctaccagcag aacaccccca tcggcgacgg ccccgtgctg 1680 

ctgcccgaca accactacct gagcacccag tccgccctga gcaaagaccc caacgagaag 1740 

cgcgatcaca tggtcctgct ggagttcgtg accgccgccg ggatcactct cggcatggac 1800 

gagctgtaca agtaa 1815 



<210> 29 
<211> 2022 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human long form/mouse species chimeric 
MCH1R- 1 inker-EGFP 

<400> 29 

atgtcagtgg gagccatgaa gaagggagtg gggagggcag ttgggcttgg aggcggcagc 60 

ggctgccagg ctacggagga agaccccctt cccaactgcg gggcttgcgc tccgggacaa 120 

ggtggcaggc gctggaggct gccgcagcct gcgtgggtgg aggggagctc agctcggttg 180 

tgggagcagg cgaccggcac tggctggatg gacctggaag cctcgctgct gcccactggt 240 

cccaacgcca gcaacacctc tgatggcccc gataacctca cttcggcagg atcacctcct 3 00 

cgcacgggga gcatctccta catcaacatc atcatgcctt cggtgttcgg caccatctgc 360 

ctcctgggca tcatcgggaa ctccacggtc atcttcgcgg tcgtgaagaa gtccaagctg 420 

cactggtgca acaacgtccc cgacatcttc atcatcaacc tctcggtagt agatctcctc 480 

tttctcctgg gcatgccctt catgatccac cagctcatgg gcaatggggt gtggcacttt 540 

ggggagacca tgtgcaccct catcacggcc atggatgcca atagtcagtt caccagcacc 600 

tacatcctga ccgccatggc cattgaccgc tacctggcca ctgtccaccc catctcttcc 660 

acgaagttcc ggaagccctc catggccacc ctggtgatct gcctcctgtg ggctctctcg 720 

ttcattagca tcactcctgt gtggctctat gccaggctta tccccttccc agggggtgct 780 

gtgggctgtg gcatccgcct accaaaccca gatactgatc tttactggtt cactctgtat 840 

cagtttttcc tggccttcgc ccttccgttt gtggtcatca ctgctgcgta cgtgaaaata 900 

ctacagcgca tgacgtcttc ggtggcccca gcctctcaac gcagcatccg gcttcggaca 960 

aagagggtga cccgcacagc cattgccatc tgtctggtct tctttgtgtg ctgggcgccc 1020 

tactacgtgc tgcagctgac ccagttgtcc atcagccgcc cgaccctcac attcgtctac 1080 

ctgtacaatg cggccatcag cttgggctat gccaacagct gcctcaatcc ctttgtgtac 1140 

atagtactct gtgagacctt tcgaaaacgc ttggtgctgt cggtgaagcc cgcggcccag 12 00 

gggcagcttc gcacggtcag caatgctcag acagctgacg aggagaggac agaaagcaaa 1260 

ggcaccgtcg acggtaccgc gggcccggga tccatcgcca ccatggtgag caagggcgag 1320 

gage tgt tea ccggggtggt gcccatcctg gtcgagctgg aeggegaegt aaacggccac 13 80 

aagttcagcg tgtceggega gggegaggge gatgccacct aeggcaaget gaccctgaag 1440 

ttcatctgea ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc 1500 

tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag 1560 

tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac 1620 

tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg 1680 

aagggcatcg actccaagga ggaeggcaac atcctggggc acaagctgga gtacaactac 1740 

aacagccaca aegtctatat catggccgac aagcagaaga aeggcatcaa ggtgaacttc 18 00 

aagatccgcc acaacatcga ggaeggcage gtgeagcteg ccgaccacta ccagcagaac 1860 

acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc 1920 

gccctgagca aagaccccaa egagaagege gatcacatgg tcctgctgga gttcgtgacc 1980 

gccgccggga tcactctcgg catggacgag ctgtacaagt aa 2022 

<210> 30 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Linker 
<400> 30 

tegaeggtae cgcgggcccg ggatccatcg ccacc 

<210> 31 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Linker 



35 



-23- 



WO 01/68706 



PCT/USU 1/08071 



<400> 31 

Val Asd Gly Thr Ala Gly Pro Gly Ser He Ala Thr 
15 10 

<210> 32 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 32 

gcgaattcac catggatctg caagcctcg 29 

<210> 33 
<21i> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 33 

gcgtcgacgg tgcctttgct ttctgtcc 28 

<210> 34 
<211> 33 
<212> DNA 

<2I3> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 34 

ccttgctcac catggtgcct ttgctttctg tec 33 

<210> 35 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 35 

cagaaagcaa aggcaccatg gtgagcaagg gegaggage 39 

<210> 36 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 36 

ggeggatect etagagtege ggee 24 

<210> 37 
<211> 38 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 37 

gctctagagt cgcggccgct tacttgtaca gctcgtcc 3 8 
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